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Clp-protease as target for herbicides 

The present invention relates to Clp-protease, which, when absent, brings about re- 
duced growth and chlorotic leaves as target for herbicides. For this purpose, novel nu- 
5 cleic acid sequences encompassing SEQ ID NO:3, SEQ ID NO:1 1 and SEQ ID NO: 17 
and functional equivalents of SEQ ID NO:3, SEQ ID NO:1i and SEQ ID NO: 17 are 
provided. Moreover, the present invention relates to the use of Clp-protease in a 
method for identifying compounds with herbicidal or growth-regulatory activity, and to 
the use of the compounds identified by this method as herbicides or growth regulators. 

10 

The basic principle of identifying herbicides via the inhibition of a defined target is 
known (for example US 5,187,071, WO 98/33925, WO 00/77185). In general, there is a 
great demand for the detection of enzymes which might constitute hovel targets for 
herbicides. The reasons are resistance problems which occur with herbicidal active 
1 5 ingredients which act on known targets, and the ongoing endeavor to identify novel , 
herbicidal active ingredients which are distinguished by as wide as possible a spectrum 
of action, ecological and toxicological acceptability and/or low application rates. 

In practice, the detection of novel targets entails great difficulties since the inhibition of 
20 an enzyme which forms part of a metabolic pathway frequently has no further effect on 
the growth of the plant. This may be attributed to the fact that the plant switches to al- 
ternative metabolic pathways whose existence is not known or that the inhibited en- 
zyme is not limiting for the metabolic pathway. Furthermore, plant genomes are distin- 
guished by a high degree of functional redundancy. Functionally equivalent enzymes 
25 are found more frequently in gene families in the Arabidopsis thaliana genome than in 
insects or mammals (Nature, 2000, 408(681 4):796-81 5). This hypothesis is confirmed 
experimentally by the fact that comprehensive gene knock-out programs by T-DNA or 
transposon insertion into Arabidopsis yielded fewer manifested phenotypes to date 
than expected (Curr. Op. Plant Biol. 4, 2001, pp.1 11-1 17). 

30 

It is an object of the present invention to identify novel targets which are essential for 
the growth of plants or whose inhibition leads to reduced plant growth, and to provide 
methods which are suitable for identifying herbicidally active and/or growth-regulatory 
compounds. 

35 

We have found that this object is achieved by the use of nuclear encoded Clp-protease 
in a method for identifying herbicides. 

Further terms used in the description are now defined at this point. 

40 

"Affinity tag": this refers to a peptide or polypeptide whose coding nucleic acid se- 
quence can be fused to the nucleic acid sequence according to the invention either 
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directly or by means of a linker, using customary i^i^lte^CB^^rttag^^ 
serves for the isolation, concentration and/or selective purification of the recombinant 
target protein by means of affinity chromatography from total cell extracts. The above- 
mentioned linker can advantageously contain a protease cleavage site (for example for 
5 thrombin or factor Xa), whereby the affinity tag can be cleaved from the target protein 
when required. Examples of common affinity tags are the tt His tag n , for example from 
Qiagen, Hilden, "Strep tag", the "Myc tag" (Invitrogen, Carlsberg), the tag from New 
England Biolabs which consists of a chitin-binding domain and an inteine, the maltose- 
binding protein (pMal) from New England Biolabs, and what is known as the CBD tag 
10 from Novagen. In this context, the affinity tag can be attached to the 5' or the 3 1 end of 
the coding nucleic acid sequence with the sequence encoding the target protein. 

"Activity of nuclear encoded CIp-protease": the term activity describes the ability of an 
enzyme to convert a substrate into a product. The enzymatic activity can be deter- 

15 mined in what is known as an activity assay via the increase in the product, the de- 
crease in the substrate (or starting material) or the decrease in a specific cofactor, or 
via a combination of at least two of the abovementioned parameters, as a function of a 
defined period of time. "Activity of nuclear encoded dp-protease" describes here the 
ability of an enzyme to catalyze the hydrolysis of peptides of maximal five amino acids 

20 in vitro. 

"Expression cassette": an expression cassette contains a nucleic add sequence ac- 
cording to the invention linked operably to at least one genetic control element, such as 
a promoter, and, advantageously, a further control element, such as a terminator. The 

25 nucleic acid sequence of the expression cassette can be for example a genomic or 
complementary DNA sequence or an RNA sequence, and their semisynthetic or fully 
synthetic analogs. These sequences can exist in linear or circular form, extrachromo- 
somally or integrated into the genome. The nucleic acid sequences in question, can be 
synthesized or obtained naturally or contain a mixture of synthetic and natural DNA 

30 components, or else consist of various heterologous gene segments of various organ- 
isms. 

Artificial nucleic acid sequences are also suitable in this context as long as they make 
possible the expression, in a cell or an organism, of a polypeptide with the enzymatic 
35 . activity of a nuclear encoded Clp Protease, preferably with the biological activity of a a 
nuclear encoded Clp Protease, which polypeptide is encoded by a nucleic acid se- 
quence according to the invention. For example, synthetic nucleotide sequences can 
be generated which have been optimized with regard to the codon usage of the organ- 
isms to be transformed. 

40 

All of the abovementioned nucleotide sequences can be generated from the nucleotide 
units by chemical synthesis in the manner known per se, for example by fragment con- 
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densation of individual overlapping complementary nucleotide units of the double helix. 
Oligonucleotides can be synthesized chemically for example in the manner known per 
se using the phosphoamid'rte method (Voet, Voet, 2nd Edition, Wiley Press New York, 
pp. 896-897). When preparing an expression cassette, various DNA fragments can be 
5 manipulated in such a way that a nucleotide sequence with the correct direction of 
reading and the correct reading frame is obtained. The nucleic acid fragments are 
linked with each other via general cloning techniques as are described, for example, in 
T. Maniatis, E.F. Fritsch and J. Sambrook, "Molecular Cloning: A Laboratory Manual", 
Cold Spring Harbor Laboratory, Cold Spring Harbor, NY (1989), in T.J. Silhavy, M.L 
10 Berman and L.W. Enquist, Experiments with Gene Fusions, Cold Spring Harbor Labo- 
ratory, Cold Spring Harbor, NY (1984) and in Ausubel, F.M. et al. t "Current Protocols in 
Molecular Biology", Greene Publishing Assoc. and Wiley-lnterscience (1 994). 

"Operable linkage" or "functional linkage": an operable, or functional, linkage is under- 
15 stood as meaning the sequential arrangement of regulatory sequences or genetic con- 
trol elements in such a way that each of the regulatory sequences, or each of the ge- 
netic control elements, can fulfill its intended function when the coding sequence is 
expressed. 

20 "Functional equivalents" describe, in the present context, nucleic acid sequences which 
hybridize under standard conditions with the nucleic acid sequence SEQ ID NO:1, SEQ 
ID NO:3, SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO:11, SEQ ID NO:3, 
SEQ ID NO:13, SEQ ID NO:15 or SEQ ID NO:17 or parts of the aforementioned nu- 
cleic acid sequences and which are capable of bringing about the expression, in a cell 

25 or an organism, of a polypeptide with the activity of Clp protease. 

To carry out the hybridization, it is advantageous to use short oligonucleotides with a 
length of approximately 10-50 bp, preferably 15-40 bp, for example of the conserved or 
other regions, which can be determined in the manner with which the skilled worker is 

30 familiar by comparisons with other related genes. However, longer fragments of the 
nucleic acids according to the invention with a length of 100-500 bp, or the complete 
sequences, may also be used for hybridization. Depending on the nucleic 
acid/oligonucleotide used, the length of the fragment or the complete sequence, or de- 
pending on which type of nucleic acid, i.e. DNA or RNA, is being used for the hybridiza- 

35 tion, these standard conditions vary. Thus, for example, the melting temperatures for 
DNA: DNA hybrids are approximately 10°C lower than those of DNA: RNA hybrids of the 
same length. 

Standard hybridization conditions are to be understood as meaning, depending on the 
40 nucleic acid, for example temperatures of between 42 and 58oC in an aqueous buffer 
solution with a concentration of between 0.1 and 5 x SSC (1 X SSC = 0.15 M NaCI, 15 
mM sodium citrate, pH 7.2) or additionally in the presence of 50% formamide, such as, 
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for example, 42°C in 5 x SSC, 50% formamide. The hybridization conditions for 
DNA: DNA hybrids are advantageously 0.1 x SSC and temperatures of between ap- 
proximately 20°C and 65°C f preferably between approximately 30°C and 45°C. In the 
case of DNA:RNA hybrids, the hybridization conditions are advantageously 0.1 x SSC 
5 and temperatures of between approximately 30 °C and 65 °C, preferably between ap- 
proximately 45°C and 55 °C. These hybridization temperatures which have been stated 
are melting temperature values which have been calculated by way of example for a 
nucleic add with a length of approx. 100 nucleotides and aG + C content of 50% in the 
absence of formamide. The experimental conditions for DNA hybridisation are de- 

10 scribed in relevant textbooks of genetics such as, for example, in Sambrook et al., "Mo-* 
lecular Cloning 0 , Cold Spring Harbor Laboratory, 1989, and can be calculated using 
formulae with which the skilled worker is familiar, for example as a function of the 
length of nucleic acids, the type of the hybrids or the G + C content. The skilled worker 
will find further information on hybridization in the following textbooks: Ausubel et al. 

15 (eds), 1985, "Current Protocols in Molecular Biology", John Wiley & Sons, New York; 
Hames and Higgins (eds.), 1985, "Nucleic Acids Hybridization: A Practical Approach 0 , 
IRL Press at Oxford University Press, Oxford; Brown (ed.), 1991, Essential Molecular 
Biology: A Practical Approach, IRL Press at Oxford University Press, Oxford. 

20 A functional equivalent of SEQ ID NO:1, SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7, 
SEQ ID NO:9, SEQ ID NO:11, SEQ ID NO:3, SEQ ID NO:13, SEQ ID NO:15 or SEQ 
ID NO: 17 can be furthermore defined by the degree of homology or identity with SEQ 
ID NO:1, SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO:11, 
SEQ ID NO:3, SEQ ID NO:13, SEQ ID NO:15 or SEQ ID NO:17, respectively, and can 

25 furthermore comprise also natural or artificial mutations of the aforementioned nucleic - 
acid sequences which encode a polypeptide with the activity of a nuclear encoded Clp- 
protease. 

The present invention also encompasses, for example, those nucleotide sequences 
30 which are obtained by modification of the SEQ ID NO:1, SEQ ID NO:3, SEQ ID NO:5, 
SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO:11, SEQ ID NO:3, SEQ ID NO:13, SEQ ID 
NO:15orSEQIDNO:17. 

For example, such modifications can be generated by techniques with which the skilled 
35 worker is familiar, such as "Site Directed Mutagenesis", "Error Prone PCR", "DNA- 
shuffling" (Nature 370, 1994, pp. 389-391) or "Staggered Extension Process" (Nature 
Biotechnol. 16, 1998, pp.258-261). The aim of such a modification can be, for example, 
the insertion of further cleavage sites for restriction enzymes, the removal of DNA in 
order to truncate the sequence, the substitution of nucleotides to optimize the codons, 
40 or the addition of further sequences. Proteins which are encoded via modified nucleic 
acid sequences must retain the desired function despite a deviating nucleic acid se- 
quence. 
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The term "functional equivalents" can also relate to the amino acid sequence encoded 
by the nucleic acid sequence in question. In this case, the term "functional equivalent" 
describes a protein whose amino acid sequence has a defined percentage of identity or 
5 homology with SEQ ID NO:3. 

Functional equivalents thus also comprise naturally occurring variants of the herein- 
described sequences and artificial nucleic acid sequences, for example those which 
have been obtained by chemical synthesis and which are adapted to the codon usage, 
10 and also the amino acid sequences derived from them. 

"Genetic control sequence" describes sequences which have an effect on the transcrip- 
tion and, if appropriate, translation of the nucleic acids according to the invention in 
prokaryotic or eukaryotic organisms. Examples thereof are promoters, terminators or 

15 what are known as "enhancer" sequences. In addition to these control sequences, or 
instead of these sequences, the natural regulation of these sequences may still be pre- 
sent before the actual structural genes and may, if appropriate, have been genetically 
modified in such a way that the natural regulation has been switched off and the ex- 
pression of the target gene has been modified, that is to say increased or reduced. The 

20 choice of the control sequence depends on the host organism or starting organism. 
Genetic control sequences furthermore also comprise the 5'-untranslated region, in- 
trons or the noncoding 3-region of genes. Control sequences are furthermore under- 
stood as meaning those which make possible homologous recombination or insertion 
into the genome of a host organism or which permit removal from the genome. Genetic 

25 control sequences also comprise further promoters, promoter elements or minimal 

promoters, and sequences which have an effect on the chromatin structure (for exam- 
ple matrix attachment regions (MARs)), which can modify the expression-governing 
properties. Thus, genetic control sequences may bring about for example the additional 
dependence of the tissue-specific expression on certain stress factors. Such elements 

30 have been described, for example, for water stress, abscisic acid (Lam E and Chua 
NH, J Biol Chem 1991; 266(26): 17131 -17135), high- and low-temperature stress 
(Plant Cell 1994, (6): 251-264) and heat stress (Molecular & General Genetics, 1989, 
217(2-3): 246-53). 

35 "Homology" between two nucleic acid sequences or polypeptide sequences is defined 
by the identity of the nucleic acid sequence/polypeptide sequence over in each case 
the entire sequence length, which is calculated by alignment with the aid of the pro- 
gram algorithm GAP according to Needleman and Wunsch 1970, J. Mol. Biol. 48; 443- 
453) setting the following parameters for polypeptides: 

40 

Gap Weight: 8 Length Weight: 2 

Average Match: 2,91 2 Average Mismatch:-2,003 



WO 2005/054283 



6 



PCT/EP2004/013555 



and the following parameters for nucleic acids: 

Gap Weight: 50 Length Weight: 3 

5 Average Match: 10.000 Average Mismatch: 0.000 

In the following text, the term identity is also used synonymously with the term "homol- 
ogy". 

10 "Mutations" of nucleic or amino acid sequences comprise substitutions, additions, dele- 
tions, inversions or insertions of one or more nucleotide residues, which may also bring 
about changes in the corresponding amino acid sequence of the target protein by sub- 
stitution, insertion or deletion of one or more amino acids, although the functional prop- 
erties of the target proteins are, overall, essentially retained. 

15 

"Natural genetic environment" means the natural chromosomal locus in the organism of 
origin, in the case of a genomic library, the natural genetic environment of the nucleic 
acid sequence is preferably retained at least in part. The environment flanks the nucleic 
acid sequence at least at the 5 - or 3-side and has a sequence length of at least 50 bp, 
20 preferably at least 100 bp, especially preferably at least 500 bp, very especially pref- 
erably at least 1000 bp, and most preferably at least 5000 bp. 

"Plants" for the purposes of the invention are plant cells, plant tissues, plant organs, or 
intact plants, such as seeds, tubers, flowers, pollen, fruits, seedlings, roots, leaves, 
25 stems or other plant parts. Moreover, the term plants is understood as meaning propar 
gation material such as seeds, fruits, seedlings, slips, tubers, cuttings or root stocks. 

"Recombinant DNA" describes a combination of DNA sequences which can be gener- 
ated by recombinant DNA technology. 

30 

"Recombinant DNA technology": generally known techniques for fusing DNA se- 
quences (for example described in Sambrook et ah, 1989, Cold Spring Harbor, NY, 
Cold Spring Harbor Laboratory Press). 

35 "Replication origins" ensure the multiplication of the expression cassettes or vectors 
according to the invention in microorganisms and yeasts, for example the pBR322 ori 
or the P15A ori in E. coli (Sambrook et aL "Molecular Cloning. A Laboratory Manual", 
2nd ed. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 1989) and the 
ARS1 ori in yeast (Nucleic Acids Research, 2000, 28(10): 2060-2068). 

40 

"Reporter genes" encode readily quantifiable proteins. The transformation efficacy or 
the expression site or timing can be assessed by means of these genes via growth 
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assay, fluorescence assay, chemoluminescence assay, bioluminescence assay or re- 
sistance assay or via a photometric measurement (intrinsic color) or enzyme activity. 
Very especially preferred in this context are reporter proteins (Schenborn E, Groskreutz 
D. Mol Biotechnol. 1999; 13(1):29-44) such as the "green fluorescent protein" (GFP) 
5 (Gerdes HH and Kaether C, FEBS Lett. 1996; 389(1 ):44-47; Chui WL et al., Curr Biol 
1996, 6:325-330; Leffel SM et al., Biotechniques. 23(5):912-8, 1997), chloramphenicol 
acetyl transferase, a luciferase (Giacomin, Plant Sci 1996, 116:59-72; Scikantha, J 
Bad 1996, 178:121; Millar et al., Plant Mol Biol Rep 1992 10:324-414), and luciferase 
genes, in general p-galactosidase or p-glucuronidase (Jefferson et al., EMBO J. 1987, 
10 6, 3901-3907) or the Ura3 gene. 

"Selection markers" confer resistance to antibiotics or other toxic compounds: exam- 
ples which may be mentioned in this context are the neomycin phosphotransferase 
gene, which confers resistance to the aminoglycoside antibiotics neomycin (G 418), 

15 kanamycin, paromycin (Deshayes A et al., EMBO J. 4 (1985) 2731-2737), the sul gene, 
which encodes a mutated dihydropteroate synthase (Guerineau F et al., Plant Mol Biol. 
1990; 15(1): 127-1 36), the hygromycin B phosphotransferase gene (Gen Bank Acces- 
sion NO: K 01 193) and the shble resistance gene, which confers resistance to the 
bleomycin antibiotics such as zeocin. Further examples of selection marker genes are 

20 genes which confer resistance to 2-deoxyglucose-6-phosphate (WO 98/45456) or 

phosphinothricin and the like, or those which confer a resistance to antimetabolites, for 
example the dhfr gene (Reiss, Plant Physiol. (Life Sci. Adv.) 13 (1994) 142-149). Ex- 
amples of other genes which are suitable are trpB or hisD (Hartman SC and Mulligan 
RC, Proc Natl Acad Sci U S A. 85 (1988) 8047-8051). Another suitable gene is the 

25 mannose phosphate isomerase gene (WO 94/20627), the ODC (ornithine decarboxy- 
lase) gene (McConlogue, 1987 in: Current Communications in Molecular Biology, Cold 
Spring Harbor Laboratory, Ed.) or the Aspergillus terreus deaminase (Tamura K et al., 
Biosci Biotechnol Biochem. 59 (1995) 2336-2338). 

30 "Transformation" describes a process for introducing heterologous DNA into a pro- or 
eukaryotic cell. The term transformed cell describes not only the product of the trans- 
formation process per se, but also al! of the transgenic progeny of the transgenic or- 
ganism generated by the transformation. 

35 "Target/target protein": a polypeptide encoded via the nucleic acid sequence according 
to the invention (this term is defined herein below), which may take the form of an en- 
zyme in the traditional sense or, for example, of a structural protein, a protein relevant 
for developmental processes, regulatory protein such as transcription factors, kinases, 
phosphatases, receptors, channel subunits, transport proteins, regulatory subunits 

40 which confer substrate or activity regulation to an enzyme complex. All of the targets or 
sites of action share the characteristic that their functional presence is essential for 
survival or normal development and growth. 
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Transgenic 0 : referring to a nucleic acid sequence, an expression cassette or a vector 
comprising a nucleic acid sequence according to the invention or an organism trans- 
formed with the abovementioned nucleic acid sequence, expression cassette or vector, 
5 the term transgenic describes all those constructs which have been generated by ge- 
netic engineering methods in which either the nucleic acid sequence of the target pro- 
tein or a genetic control sequence linked operably to the nucleic acid sequence of the 
target protein or a combination of the abovementioned possibilities are not in their natu- 
ral genetic environment or have been modified by recombinant methods. In this con- 
10 text, the modification can be achieved, for example, by mutating one or more nucleo- 
tide residues of the nucleic acid sequence in question. 

Intracellular protein degradation and its regulation are important for several processes 
like recycling of aminoacids, prevention of protein agglomeration and regulation of sig- 
15 naling processes (e.g. signalling of phytohormones). Cytosolic Proteins that are to be 
degraded are ubiquitinylated at the N-terminus and delivered to the proteasome which 
is established as a complex of a large number of protein components in eucaroytes. 
Roughly 12% of the genes in Arabidopsis thaliana are encoding proteins envolved in 
protein degradation by the ubiquitin pathway. 

20 

The stroma of plant chloroplasts contains a unique ubiquitin-independent, ATP- 
dependent protease consisting of two mayor components, a serine-type protease 
(CIpP) and an ATPase (CIpC, -D, -X) both of which are encoded by enzyme families in 
Arabidopsis thaliana (for details on the differing nomenclatures in literature see Adam 

25 et al. 2001 , Plant Physiology 125, pp.1912-18). Six unique CIpP Isoforms (ClpP1-6), are 
nuclear encoded in Arabidopsis and at least one CIpP ist encoded in the plastid ge- 
nome (pCIpP) all of which carry the three conserved active site aminoacids characteris- 
tical for a catalytic triade of serine proteases. Some sequences ofmRNA for putative 
ATP-dependent protease proteolytic sub units CIpP are disclosed in Nakabayashi et al. 

30 ( Plant Cell Physiol 40: 504-514, 1999) and Kotani et al. (DNA Research 4, 291-300, 
1997). A subunit of Clp protease, which does not show any own activbity of a protease 
is disclosed in WO 2003008440 A. Further Clp gene from algae, tobacco or cyanobao 
terium are depicted in Huang et al. (Mol. Gen. Genet 244, 151-159, 1994), Shikanai et 
al. (Plant Cell Physiol. 42, 261-273, 2001) and Clarke et al. (Plant Molecular Biology 

35 37, 791-801, 1998) respectively. 

Further three nuclear encoded ClpP-lsoforms which miss the conserved amino acid 
residues of the catalytic triade are found in Arabidopsis (ClpR1, ClpR3, ClpR4). The 
catalytic activity of CIpR-type ClpP-lsoforms has not been shown so far. 

40 At least one CIpP and two CIpX proteins may be targeted to mitochondria in Arabidop- 
sis as deduced from N-terminal signal sequences. CIpP Proteases are conserved in 
bacteria. The CIpP protease in E.coli was formerly known as "protease Ti n . A knock out 
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of the protease Ti was shown to be not lethal. E.coli CIpP is assambled as a complex 
of 14 CIpP subunits in two heptameric rings. Co-immunoprecipitation suggests com- 
plexes of similar sizes and an ATP-dependet interaction of CIpP and CIpC subunits in 
Chloroplasts of Arabidopsis thaliana (Halperin et al. 2001, Planta 213, pp. 614-619). 
5 Furthermore, a 350kDa CIpP complex has been identified in Arabidopsis chloroplasts 
using blue native gel electrophoresis. The complex presumingly containins most of the 
known CIpP Isoenzymes (Benoit-Peltier et al. 2001, Journal of Biological Chemistry 
276, pp. 16348-16327). Consequently the complexity and redundancy of plant Clp pro- 
teases is high and detailed information about composition of the clp complex and the 
10 functional role of its subunits remain to be clarified. Particularly the role of CIpP redun- 
dancy is still unclear. 

The CIpP subunit is capable of activly hydrolysing peptides of max. 5 aminoacids in 
vitro. ClpA,B,C subunits constitute ATP-hydrolysing chaperones which unfold target- 

15 proteins and present them for hydrolysis to CIpP (Porankiewicz et al. 1999, Molecular 
Microbiology 32, 449-458). Involvement of CIpP in the degradation of the cytochrome 
b6f complex an PSII has been decribed in Chlamydomonas (Majeran et al. 2001 , Plant 
Physiology 23+, pp. 421-433). Functional properties of CIpR-type Clp-Proteases as 
well as the CIpP like Proeases are yet to be determined. 

20 Surprisingly, it has been found within the scope of the present invention that plants in 
which a Clp protease was reduced in a selective manner have phenotypes which are 
comparable with phenotypes generated by herbicide application. Drastic growth retar- 
dation and damage such as were observed. 

25 The present invention relates to the use of a polypeptide, which has the activity of nu- 
clear encoded Clp-protease in a method for identifying herbicides, preferably of a poly- 
peptide, which has the activity of nuclear encoded Clp-protease, which is 

a) selected from the group consisting of ClpP1 -protease, ClpP2-protease, ClpP3- 
30 protease, ClpP4-protease and ClpP6-protease; or 

b) selected from the group consisting of ClpR1 -protease, ClpR3-protease, CIpR4- 
protease; or 

35 c) ClpP-like-protease, wherein more preferably 

a) the ClpP1 -protease is encoded by a nucleic acid sequence which comprises: 



40 



0 



a nucleic acid sequence with the nucleic acid sequence shown in SEQ ID 
NO:1,or 
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ii) a nucleic acid sequence which, owing to the degeneracy of the genetic 
code, can be deduced from the amino acid sequence shown in SEQ ID 
NO:2 by back translating, or 

5 iii) a functional equivalent of nucleic acid sequence shown in SEQ ID NO:1 

which has an identity with SEQ ID NO:1 of has at least 50%; or 

iv) a functional equivalent of the nucleic acid sequence shown in SEQ ID 
NO:1, which is encoded by an amino acid sequence that has at least an 
1 0 identity of 50% with the SEQ ID NO:2; 

b) the ClpP2-protease encoded by a nucleic acid sequence which comprises: 

i) a nucleic acid sequence with the nucleic acid sequence shown in SEQ ID 
15 NO:3, or 

ii) a nucleic acid sequence which, owing to the degeneracy of the genetic 
code, can be deduced from the amino acid sequence shown in SEQ ID 
NO:4 by back translating, or 



20 



iii) a functional equivalent of nucleic acid sequence shown in SEQ ID NO:3 
which has an identity with SEQ ID NO:3 of has at least 50%; or 



iv) a functional equivalent of the nucleic acid sequence shown in SEQ ID 
25 NO:3, which is encoded by an amino add sequence that has at least an 

identity of 50% with the SEQ ID NO:4; 

c) the ClpP3-protease is encoded by a nucleic acid sequence which comprises: 

30 i) a nucleic acid sequence with the nucleic acid sequence shown in SEQ ID 

NO:5, or 

ii) a nucleic acid sequence which, owing to the degeneracy of the genetic 
code, can be deduced from the amino acid sequence shown in SEQ ID 

35 NO:6 by back translating, or 

iii) a functional equivalent of nucleic acid sequence shown in SEQ ID NO:5 
which has an identity with SEQ ID NO:5 of has at least 50%;or 



40 



iv) 



a functional equivalent of the nucleic add sequence shown in SEQ ID 
NO:5, which is encoded by an amino acid sequence that has at least an 
identity of 50% with the SEQ ID NO:6; . 
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the ClpP4-protease is encoded by a nucleic acid sequence which comprises: 

i) a nucleic acid sequence with the nucleic acid sequence shown in SEQ ID 
NO:7,or 

ii) a nucleic acid sequence which, owing to the degeneracy of the genetic 
code, can be deduced from the amino acid sequence shown in SEQ ID 
NO:8 by back translating, or 

iii) a functional equivalent of nucleic acid sequence shown in SEQ ID NO:7 
which has an identity with SEQ ID NO:7 of has at least 50%; or 

iv) a functional equivalent of the nucleic acid sequence shown in SEQ ID 
NO:7, which is encoded by an amino acid sequence that has at least an 
identity of 50% with the SEQ ID NO:8; 

the ClpP6-protease is encoded by a nucleic acid sequence which comprises: 

i) a nucleic acid sequence with the nucleic acid sequence shown in SEQ ID 
NO:9,or 

ii) a nucleic acid sequence which, owing to the degeneracy of the genetic 
code, can be deduced from the amino acid sequence shown in SEQ ID 
NO: 10 by back translating, or 

iii) a functional equivalent of nucleic acid sequence shown in SEQ ID NO:9 
which has an identity with SEQ ID NO:9 of has at least 50%; or 

iv) a functional equivalent of the nucleic acid sequence shown in SEQ ID 
NO:9, which is encoded by an amino acid sequence that has at least an 
identity of 50% with the SEQ ID NO:10; 

the ClpR1 -protease is encoded by a nucleic acid sequence which comprises: 

i) a nucleic acid sequence with the nucleic acid sequence shown in SEQ ID 
NO:11,or 

ii) a nucleic acid sequence which, owing to the degeneracy of the genetic 
code, can be deduced from the amino acid sequence shown in SEQ ID 
NO: 12 by back translating, or 
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iii) a functional equivalent of nucleic acid sequence shown in SEQ ID NO:1 1 
which has an identity with SEQ ID NO:1 1 of has at least 50%; or 

iv) a functional equivalent of the nucleic acid sequence shown in SEQ ID 

5 NO:1 1 , which is encoded by an amino acid sequence that has at least an 

identity of 50% with the SEQ ID NO:12; 

g) the ClpR3-protease is encoded by a nucleic acid sequence which comprises: 

10 i) a nucleic acid sequence with the nucleic acid sequence shown in SEQ ID 

NO:13,or 

ii) a nucleic acid sequence which, owing to the degeneracy of the genetic 
code, can be deduced from the amino acid sequence shown in SEQ ID 

15 NO:14 by back translating, or 

iii) a functional equivalent of nucleic acid sequence shown in SEQ ID NO: 13 
which has an identity with SEQ ID NO:13 of has at least 50%; or 

20 iv) a functional equivalent of the nucleic acid sequence shown in SEQ ID 

NO: 13, which is encoded by an amino acid sequence that has at least an 
identity of 50% with the SEQ ID NO:14; 



25 



35 



h) the ClpR4-protease is encoded by a nucleic acid sequence which comprises: 

i) a nucleic acid sequence with the nucleic acid sequence shown in SEQ ID 
NO: 15, or 



ii) a nucleic acid sequence which, owing to the degeneracy of the genetic 
30 code, can be deduced from the amino acid sequence shown in SEQ ID 

NO: 1 6 by back translating, or 



iii) a functional equivalent of nucleic acid sequence shown in SEQ ID NO: 15 
which has an identity with SEQ ID NO: 15 of has at least 50%; or 

iv) a functional equivalent of the nucleic acid sequence shown in SEQ ID 
NO: 15, which is encoded by an amino acid sequence that has at least an 
identity of 50% with the SEQ ID NO:16; 



40 i) the 



CIpP like-protease is encoded by a nucleic acid sequence which comprises: 
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i) a nucleic acid sequence with the nucleic acid sequence shown in SEQ ID 
NO:17, or 



ii) a nucleic acid sequence which, owing to the degeneracy of the genetic 
5 code, can be deduced from the amino acid sequence shown in SEQ ID 

NO:1 8 by back translating, or 



10 



Hi) a functional equivalent of nucleic acid sequence shown in SEQ ID NO: 17 
which has an identity with SEQ ID NO:17 of has at least 50%; or 

iv) a functional equivalent of the nucleic acid sequence shown in SEQ ID 
NO: 17, which is encoded by an amino acid sequence that has at least an 
identity of 50% with the SEQ ID NO: 18; 

15 wherein the sequences b) i-iv, e) i-iv, f) i-iv and are especially preferred 

The term "comprising" in relation to a nucleic acid sequence means that the nucleic 
acid sequence can be flanked by additional nucleic acid sequences that have on the 5' 
end and on the 3' end or on the 5'end or on the 3' end on the end a sequence length of 
at least 1000 bp, preferably at least 500 bp, more preferably at least 250bp, most pref- 

20 erably at least 100bp. 



The functional equivalent according to the invention of SEQ ID NO:1 as described In a) 
iii), which encodes a polypeptide, which has the activity of nuclear encoded Clp- 
protease, and has at least an identity of 50%, 51%, 52%, 53%, 54%, 55%, 56%, 57%, 
25 58%, 59%, 60%, 61%, 62%, 63%, 64% or 65% or preferably of 66%, 67%, 68%, 69%, 
70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78% or 79% more preferably of 80%, 
81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89% or 90% most preferably of 91%, 
92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% with SEQ ID NO:1. 



30 The functional equivalents of the nucleic acid sequence SEQ ID NO:1 set forth in a) iv. 
are encoded by an amino acid sequence, which has the activity of nuclear encoded 
Clp-protease and has at least an identity of 50%, 51%, 52%, 53%, 54%, 55%, 56%, 
57%, 58%, 59%, 60%, 61%, 62%, 63%, 64% or 65% or preferably of 66%, 67%, 68%, 
69%, 70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78% or 79% more preferably of 

35 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89% or 90% most preferably of 
91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% with SEQ ID NO:2. 

The functional equivalent according to the invention of SEQ ID NO:3 as described in b) 
iii), which encodes a polypeptide, which has the activity of nuclear encoded Clp- 
40 protease, and has at least an identity of 50%, 51%, 52%, 53%, 54%, 55%, 56%, 57%, 
58%, 59%, 60%, 61%, 62%, 63%, 64% or 65% or preferably of 66%, 67%, 68%, 69%, 
70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78% or 79% more preferably of 80%, 



WO 2005/054283 PCT/EP2004/0 13555 

14 

81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89% or 90% most preferably of 91%, 
92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% with SEQ ID NO:3. 

An example of a functional equivalent of SEQ ID NO: 3 is the nucleic acid sequence of 
5 Arabidopsis thaliana (Gene Bank Acc. No. AB022327). This sequence is herein incor- 
porated by reference. 

The functional equivalents of the nucleic acid sequence set forth SEQ ID NO:3 in b) iv. 
are encoded by an amino acid sequence, which has the activity of nuclear encoded 
10 Clp-protease and has at least an identity of 50%, 51%, 52%, 53%, 54%, 55%, 56%, 
57%, 58%, 59%, 60%, 61%, 62%, 63%, 64% or 65% or preferably of 66%, 67%, 68%, 
69%, 70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78% or 79% more preferably of 
80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89% or 90% most preferably of 
91 %, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% with SEQ ID NO:4. 

15 

The functional equivalent according to the invention of SEQ ID NO:5 as described in c) 
iii), which encodes a polypeptide, which has the activity of nuclear encoded Clp- 
protease, and has at least an identity of 50%, 51%, 52%, 53%, 54%, 55%, 56%, 57%, 
58%, 59%, 60%, 61%, 62%, 63%, 64% or 65% or preferably of 66%, 67%, 68%, 69%, 
20 70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78% or 79% more preferably of 80%, 
81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89% or 90% most preferably of 91%, 
92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% with SEQ ID NO:5. 

The functional equivalents of the nucleic acid sequence SEQ ID NO:5 set forth in c) iv. 

25 are encoded by an amino acid sequence, which has the activity of nuclear encoded 
Clp-protease and has at least an identity of 50%, 51%, 52%, 53%, 54%, 55%, 56%, 
57%, 58%, 59%, 60%, 61 %, 62%, 63%, 64% or 65% or preferably of 66%, 67%, 68%, 
69%, 70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78% or 79% more preferably of 
80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89% or 90% most preferably of 

30 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% with SEQ ID NO:6. 

The functional equivalent according to the invention of SEQ ID NO: 7 as described in d) 
iii), which encodes a polypeptide, which has the activity of nuclear encoded Clp- 
protease, and has at least an identity of 50%, 51%, 52%, 53%, 54%, 55%, 56%, 57%, 
35 58%, 59%, 60%, 61 %, 62%, 63%, 64% or 65% or preferably of 66%, 67%, 68%, 69%, 
70%, 71 %, 72%, 73%, 74%, 75%, 76%, 77%, 78% or 79% more preferably of 80%, 
81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89% or 90% most preferably of 91%, 
92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% with SEQ ID NO:7. 

40 The functional equivalents of the nucleic acid sequence set forth SEQ ID NO:7 in d) iv. 
are encoded by an amino acid sequence, which has the activity of nuclear encoded 
Clp-protease and has at least an identity of 50%, 51%, 52%, 53%, 54%, 55%, 56%, 
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57%, 58%, 59%, 60%, 61%, 62%, 63%, 64% or 65% or preferably of 66%, 67%, 68%, 
69%, 70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78% or 79% more preferably of 
80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89% or 90% most preferably of 
91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% with SEQ ID NO:7. 

5 

The functional equivalent according to the invention of SEQ ID NO:9 as described in e) 
iii), which encodes a polypeptide, which has the activity of nuclear encoded Clp- 
protease, and has at least an identity of 50%, 51%, 52%, 53%, 54%, 55%, 56%, 57%, 
58%, 59%, 60%, 61%, 62%, 63%, 64% or 65% or preferably of 66%, 67%, 68%, 69%, 
10 70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78% or 79% more preferably of 80%, 
81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89% or 90% most preferably of 91%,' 
92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% with SEQ ID NO:9. 

The functional equivalents of the nucleic acid sequence set forth SEQ ID NO:9 in e) iv. 

15 are encoded by an amino acid sequence, which has the activity of nuclear encoded 
Clp-protease and has at least an identity of 50%, 51%, 52%, 53%, 54%, 55%, 56%, 
57%, 58%, 59%, 60%, 61%, 62%, 63%, 64% or 65% or preferably of 66%, 67%, 68%, 
69%, 70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78% or 79% more preferably of 
80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89% or 90% most preferably of 

20 91 %, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% with SEQ ID NO: 10. 

The functional equivalent according to the invention of SEQ ID NO:11 as described in f) 
iii), which encodes a polypeptide, which has the activity of nuclear encoded Clp- 
protease, and has at least an identity of 50%, 51%, 52%, 53%, 54%, 55%, 56%, 57%, 
25 58%, 59%, 60%, 61 %, 62%, 63%, 64% or 65% or preferably of 66%, 67%, 68%, 69%, 
70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78% or 79% more preferably of 80%, 
81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89% or 90% most preferably of 91%, 
92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% with SEQ ID NO:1 1 . 

30 The functional equivalents of the nucleic acid sequence SEQ ID NO: 1 1 set forth in f) iv. 
are encoded by an amino acid sequence, which has the activity of nuclear encoded 
Clp-protease and has at least an identity of 50%, 51%, 52%, 53%, 54%, 55%, 56%, 
57%, 58%, 59%, 60%, 61%, 62%, 63%, 64% or 65% or preferably of 66%, 67%, 68%, 
69%, 70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78% or 79% more preferably of 

35 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89% or 90% most preferably of 
91 %, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% with SEQ ID NO:12. 

An example of a functional equivalent of SEQ ID NO: 1 1 is the nucleic acid sequence 
of Arabidopsis thaliana (Gene Bank Acc. No. AB022330). This sequence is herein in- 
40 corporated by reference. 
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The functional equivalent according to the invention of SEQ ID NO: 13 as described in 

g) iii), which encodes a polypeptide, which has the activity of nuclear encoded Clp- 
protease, and has at least an identity of 50%, 51%, 52%, 53%, 54%, 55%, 56%, 57%, 
58%, 59%, 60%, 61%, 62%, 63%, 64% or 65% or preferably of 66%, 67%, 68%, 69%, 

5 70%, 71 %, 72%, 73%, 74%, 75%, 76%, 77%, 78% or 79% more preferably of 80%, 
31%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89% or 90% most preferably of 91%, 
92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% with SEQ ID NO: 13. 

The functional equivalents of the nucleic acid sequence SEQ ID NO: 13 set forth in g) 
10 iv. are encoded by an amino acid sequence, which has the activity of nuclear encoded 
Clp-protease and has at least an identity of 50%, 51%, 52%, 53%, 54%, 55%, 56%, 
57%, 58%, 59%, 60%, 61%, 62%, 63%, 64% or 65% or preferably of 66%, 67%, 68%, 
69%, 70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78% or 79% more preferably of 
80%, 81 %, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89% or 90% most preferably of 
15 91 %, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% with SEQ ID NO:14. 

The functional equivalent according to the invention of SEQ ID NO: 15 as described in 

h) iii), which encodes a polypeptide, which has the activity of nuclear encoded Clp- 
protease, and has at least an identity of 50%, 51%, 52%, 53%, 54%, 55%, 56%, 57%, 

20 58%, 59%, 60%, 61 %, 62%, 63%, 64% or 65% or preferably of 66%, 67%, 68%, 69%, 
70%, 71 %, 72%, 73%, 74%, 75%, 76%, 77%, 78% or 79% more preferably of 80%, 
81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89% or 90% most preferably of 91%, 
92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% with SEQ ID NO:15. 

25 The functional equivalents of the nucleic acid sequence SEQ ID NO:1 5 set forth in h) 
iv. are encoded by an amino acid sequence, which has the activity of nuclear encoded 
Clp-protease and has at least an identity of 50%, 51%, 52%, 53%, 54%, 55%, 56%, 
57%, 58%, 59%, 60%, 61%, 62%, 63%, 64% or 65% or preferably of 66%, 67%, 68%, 
69%, 70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78% or 79% more preferably of 

30 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89% or 90% most preferably Of 
91 %. 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% with SEQ ID NO: 16. 

The functional equivalent according to the invention of SEQ ID NO: 17 as described in i) 
iii), which encodes a polypeptide, which has the activity of nuclear encoded Clp- 
35 protease, and has at least an identity of 50%, 51%, 52%, 53%, 54%, 55%, 56%, 57%, 
58%, 59%, 60%, 61%, 62%, 63%, 64% or 65% or preferably of 66%, 67%, 68%, 69%, 
70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78% or 79% more preferably of 80%, 
81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89% or 90% most preferably of 91%, 
92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% with SEQ ID NO:17. 

40 

The functional equivalents of the nucleic acid sequence SEQ ID NO: 17 set forth in i) iv. 
are encoded by an amino acid sequence, which has the activity of nuclear encoded 
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Clp-protease and has at least an identity of 50%, 51%, 52%, 53%, 54%, 55%, 56%, 
57%, 58%, 59%, 60%, 61%, 62%, 63%, 64% or 65% or preferably of 66%, 67%, 68%, 
69%, 70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78% or 79% more preferably of 
80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89% or 90% most preferably of 
5 91 %, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% with SEQ ID NO:18. 

An example of a functional equivalent of SEQ ID NO: 17 is the nucleic acid sequence 
of Arabidopsis thaliana (Gene Bank Acc. No. AK1 18525). This sequence is herein in- 
corporated by reference. 



Furthermore claimed within the scope of the present invention are plant nucleic acid 
sequence 

I) encoding a ClpP2-protease comprising: 



10 



15 



a) 



a nucleic acid sequence with the nucleic acid sequence shown in SEQ ID 
NO:3, or 



b) 



a nucleic acid sequence which, owing to the degeneracy of the genetic 
code, can be deduced from the amino acid sequence shown in SEQ ID 
NO:4 by backtranslating, or 



20 



c) 



a functional equivalent of nucleic acid sequence shown in SEQ ID NO:1 
which has an identity with SEQ ID NO:3 of has at least 66%; or 



25 



a functional equivalent of the nucleic acid sequence shown in SEQ ID 
NO:1 1, which is encoded by an amino acid sequence that has at least an 
identity of 76% with the SEQ ID NO:4; 



30 II) encoding a ClpR1 -protease comprising: 



a) a nucleic acid sequence with the nucleic acid sequence shown in SEQ ID 
NO:11, or 



35 



b) a nucleic acid sequence which, owing to the degeneracy of the genetic 
code, can be deduced from the amino acid sequence shown in SEQ ID 
NO: 12 by backtranslating, or 



40 



c) a functional equivalent of nucleic acid sequence shown in SEQ ID NO:1 
which has an identity with SEQ ID NO:1 1 of has at least 69%; or 
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d) a functional equivalent of the nucleic acid sequence shown in SEQ ID 
NO:1 1, which is encoded by an amino acid sequence that has at least an 
identity of 71 % with the SEQ ID NO: 1 2; 

5 III) encoding a ClpP-like-protease comprising: 

a) a nucleic acid sequence with the nucleic acid sequence shown in SEQ ID 
NO: 17, or 

10 b) a nucleic acid sequence which, owing to the degeneracy of the genetic 

code, can be deduced from the amino acid sequence shown in SEQ ID 
NO: 18 by backtranslating, or 

c) a functional equivalent of nucleic acid sequence shown in SEQ ID NO:1 
15 which has an identity with SEQ ID NO: 17 of has at least 67%; or 

d) a functional equivalent of the nucleic acid sequence shown in SEQ ID 
NO: 17, which is encoded by an amino acid sequence that has at least an 
identity of 79% with the SEQ ID NO: 18; 

20 The functional equivalent of SEQ ID NO:3 set forth in I c) has at least an identity of 
66%, 67%, 68%, 69%, 70%, 71%, 72%, 73%, 74%, by preference at least 75%, 76%, 
77%, 78%, 79%, 80%, 81%, 82% or 83%, preferably at least 84%, 85%, 86%, 87%, 
88%, 89%, 90%, 91%, 92% or 93%, especially preferably at least 94%, 95%, 96%, 
97%, 98% or 99% with SEQ ID NO:3. 

25 

The functional equivalents of the nucleic acid sequence SEQ ID NO:3 set forth in I) d) 
are encoded by an amino acid sequence, which has the activity of nuclear encoded 
Clp-protease and has at least an identity of 77%, by preference at least 78%, 79%, 
80%, 81%, 82% or 83%, preferably at least 84%, 85%, 86%, 87%, 88%, 89%, 90%, 
30 91%, 92%, 93%, especially preferably at least 94%, 95%, 96%, 97%, 98%, 99% with 
SEQIDNO:4. 

The functional equivalent of SEQ ID NO:1 1 set forth in II c) has at least an identity of 
69%, 70%, 71%, 72%, 73% or 74%, by preference at least 75%, 76%, 77%, 78%, 79%, 
35 80%, 81 %, 82% or 83%, preferably at least 84%, 85%, 86%, 87%, 88%, 89%, 90%, 
91%, 92% or 93%, especially preferably at least 94%, 95%, 96%, 97%, 98% or 99% 
with SEQ ID NO:11. 

The functional equivalents of the nucleic acid sequence SEQ ID NO:1 1 set forth in II) d) 
40 are encoded by an amino acid sequence, which has the activity of nuclear encoded 

Clp-protease and has at least an identity of 71% by preference at least 72%, 73%,74%, 
75%, 76%, 77%, 78%, 79%, 80%, 81%, 82%, 83%, preferably at least 84%, 85%, 86%, 
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87%, 88%, 89%, 90%, 91%, 92%, 93%, especially preferably at least 94%, 95%, 96%, 
97%, 98%, 99% with SEQ ID NO: 12. 

The functional equivalent of SEQ ID NO: 17 set forth in I c) has at least an identity of 
5 67%, 68%, 69%, 70%, 71%, 72%, 73%, 74%, by preference at least 75%, 76%, 77%, 
78%, 79%, 80%, 81%, 82% or 83%, preferably at least 84%, 85%, 86%, 87%, 88%, 
89%, 90%, 91%, 92% or 93%, especially preferably at least 94%, 95%, 96%, 97%, 
98% or 99% with SEQ ID NO: 17. 

10 The functional equivalents of the nucleic acid sequence SEQ ID NO: 17 set forth in I) d) 
are encoded by an amino acid sequence, which has the activity of nuclear encoded 
CIp-protease and has at least an identity of 79%, by preference at least 79%, 80%, 
81%, 82% or 83%, preferably at least 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 
92%, 93%, especially preferably at least 94%, 95%, 96%, 97%, 98%, 99% with SEQ ID 

15 NO:18. 

The polypeptides encoded by the abovementioned nucleic acid sequences according 
to I c)-d), II c)-d) and III c)-d) are likewise claimed. The functional equivalents as de- 
scribed in c) and d) are distinguished by the same functionality, i.e. they have the activ- 
20 ity of a clp-protease. 

The nucleic acid sequences I c)-d), II c)-d) and III c)-d) are hereinbelow termed NCLP- 
sequences. 

25 The term "nucleic acid sequences according to the invention" which is used hereinbe- 
low refers to nucleic acid sequences encoding a polypeptide, which has the activity of 
nuclear encoded Clp-protease in a method for identifying herbicides, preferably of a 
polypeptide, which has the activity of nuclear encoded Clp-protease, which is 

30 a) selected from the group consisting of ClpP1 -protease, ClpP2-protease, ClpP3- 
protease, ClpP4-protease and ClpP6-protease; or 

b) selected from the group consisting of ClpR1 -protease, ClpR3-protease, ClpR4- 
protease; or 

35 

c) ClpP-like-protease, wherein more preferably 

a) the ClpP1 -protease is encoded by a nucleic acid sequence which comprises: 



40 



i) a nucleic acid sequence with the nucleic acid sequence shown in SEQ ID 
NO:1, or 
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ii) a nucleic acid sequence which, owing to the degeneracy of the genetic 
code, can be deduced from the amino acid sequence shown in SEQ ID 
NO:2 by back translating, or 

5 iii) a functional equivalent of nucleic acid sequence shown in SEQ ID NO:1 

which has an identity with SEQ ID NO:1 of has at least 50%; or 

iv) a functional equivalent of the nucleic acid sequence shown in SEQ ID 
NO:1, which is encoded by an amino acid sequence that has at least an 
10 identity of 50% with the SEQ ID NO:2; 

b) the ClpP2-protease encoded by a nucleic add sequence which comprises: 

i) a nucleic acid sequence with the nucleic acid sequence shown in SEQ ID 
15 NO:3, or 

ii) a nucleic acid sequence which, owing to the degeneracy of the genetic 
code, can be deduced from the amino acid sequence shown in SEQ ID 
NO:4 by back translating, or 



20 



iii) a functional equivalent of nucleic acid sequence shown in SEQ ID NO:3 
which has an identity with SEQ ID NO:3 of has at least 50%; or 



iv) a functional equivalent of the nucleic acid sequence shown in SEQ ID 
25 NO:3, which is encoded by an amino acid sequence that has at least an 

identity of 50% with the SEQ ID NO:4; 

c) the ClpP3-protease is encoded by a nucleic acid sequence which comprises: 

30 i) a nucleic acid sequence with the nucleic acid sequence shown in SEQ ID 

NO:5, or 

ii) a nucleic acid sequence which, owing to the degeneracy of the genetic 
code, can be deduced from the amino acid sequence shown in SEQ ID 

35 NO:6 by back translating, or 

iii) a functional equivalent of nucleic acid sequence shown in SEQ ID NO:5 
which has an identity with SEQ ID NO:5 of has at least 50%;or 



40 



iv) 



a functional equivalent of the nucleic acid sequence shown in SEQ ID 
NO:5, which is encoded by an amino acid sequence that has at least an 
identity of 50% with the SEQ ID NO:6; 
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10 



35 



d) the ClpP4-protease is encoded by a nucleic acid sequence which comprises: 

i) a nucleic acid sequence with the nucleic acid sequence shown in SEQ ID 
NO:7, or 

ii) a nucleic acid sequence which, owing to the degeneracy of the genetic 
code, can be deduced from the amino acid sequence shown in SEQ ID 
NO:8 by back translating, or 

iii) a functional equivalent of nucleic acid sequence shown in SEQ ID NO:7 
which has an identity with SEQ ID NO:7 of has at least 50%; or 



iv) a functional equivalent of the nucleic acid sequence shown in SEQ ID 
15 NO:7, which is encoded by an amino acid sequence that has at least an 

identity of 50% with the SEQ ID NO:8; 

e) the ClpP6-protease is encoded by a nucleic acid sequence which comprises: 

20 i) a nucleic acid sequence with the nucleic acid sequence shown in SEQ ID 

NO:9, or 

ii) a nucleic acid sequence which, owing to the degeneracy of the genetic 
code, can be deduced from the amino acid sequence shown in SEQ ID 

25 NO: 10 by back translating, or 

iii) a functional equivalent of nucleic acid sequence shown in SEQ ID NO; 9 
which has an identity with SEQ ID NO:9 of has at least 50%; or 

30 iv) a functional equivalent of the nucleic acid sequence shown in SEQ ID 

NO:9, which is encoded by an amino acid sequence that has at least an 
identity of 50% with the SEQ ID NO:1 0; 



f) the ClpR1 -protease is encoded by a nucleic acid sequence which comprises: 

i) a nucleic acid sequence with the nucleic acid sequence shown in SEQ ID 
NO:11,or 



ii) a nucleic acid sequence which, owing to the degeneracy of the genetic 
40 code, can be deduced from the amino acid sequence shown in SEQ ID 

NO: 12 by back translating, or 
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iii) a functional equivalent of nucleic acid sequence shown in SEQ ID NO:1 1 
which has an identity with SEQ ID NO:1 1 of has at least 50%; or 

iv) a functional equivalent of the nucleic add sequence shown in SEQ ID 

5 NO:1 1, which is encoded by an amino acid sequence that has at least an 

identity of 50% with the SEQ ID NO:12; 

g) the ClpR3-protease is encoded by a nucleic acid sequence which comprises: 

10 i) a nucleic acid sequence with the nucleic acid sequence shown in SEQ ID 

NO:13, or 

ii) a nucleic acid sequence which, owing to the degeneracy of the genetic 
code, can be deduced from the amino acid sequence shown in SEQ ID 

15 NO: 14 by back translating, or 

iii) a functional equivalent of nucleic acid sequence shown in SEQ ID NO: 13 
which has an identity with SEQ ID NO:13 of has at least 50%; or 

20 iv) a functional equivalent of the nucleic acid sequence shown in SEQ ID 

NO: 13, which is encoded by an amino acid sequence that has at least an 
identity of 50% with the SEQ ID NO: 14; 



25 



35 



h) the ClpR4-protease is encoded by a nucleic acid sequence which comprises: 

i) a nucleic acid sequence with the nucleic acid sequence shown in SEQ ID 
NO: 15, or 



ii) a nucleic acid sequence which, owing to the degeneracy of the genetic 
30 code, can be deduced from the amino acid sequence shown in SEQ ID 

NO: 16 by back translating, or 



iii) a functional equivalent of nucleic acid sequence shown in SEQ ID NO: 15 
which has an identity with SEQ ID NO:15 of has at least 50%; or 

iv) a functional equivalent of the nucleic acid sequence shown in SEQ ID 
NO:15, which is encoded by an amino acid sequence that has at least an 
identity of 50% with the SEQ ID NO:16; 



40 



i) 



the CIpP like-protease is encoded by a nucleic acid sequence which comprises: 
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a nucleic acid sequence with the nucleic acid sequence shown in SEQ ID 
NO: 17, or 

a nucleic acid sequence which, owing to the degeneracy of the genetic 
code, can be deduced from the amino acid sequence shown in SEQ ID 
NO: 18 by back translating, or 

a functional equivalent of nucleic acid sequence shown in SEQ ID NO:17 
which has an identity with SEQ ID NO: 17 of has at least 50%; or 

a functional equivalent of the nucleic acid sequence shown in SEQ ID 
NO: 17, which is encoded by an amino acid sequence that has at least an 
identity of 50% with the SEQ ID NO: 18; 

15 wherein the sequences b) i-iv, e) i-iv, f) i-iv and are especially preferred 

A polypeptide, which has the activity of nuclear encoded Clp-protease and is encoded 
by a nucleic acid sequence according to the invention are hereinbelow simply referred 
to as U CLP M . 

20 

Reduced amounts of glyoxysomal CLP cause growth retardation and necrotic and 
chlorotic leaves in plants. 

The gene products of the nucleic acids according to the invention constitute novel tar- 
25 gets for herbicides, which make possible the provision of novel herbicides for control- 
ling undesired plants. Moreover, the gene products of the nucleic acids according to 
the invention constitute novel targets for growth regulators which make possible the 
provision of novel growth regulators for regulating the growth of plants. 

30 Undesired plants are understood as meaning, in the broadest sense, all those plants 
which grow at locations where they are undesired, for example: 

Dicotyledonous weeds of the genera: Sinapis, Lepidium, Galium, Stellaria, Matricaria, 
Anthemis, Galinsoga, Chenopodium, Urtica, Senecio, Amaranthus, Portulaca, Xan- 
35 thium, Convolvulus, Ipomoea, Polygonum, Sesbania, Ambrosia, Cirsium, Carduus, 
Sonchus, Soianum, Rorippa, Rotala, Lindernia, Lamium, Veronica, Abutilon, Emex, 
Datura, Viola, Galeopsis, Papaver, Centaurea, Trifolium, Ranunculus, Taraxacum. 

Monocotyledonous weeds from the genera: Echinochloa, Setaria, Panicum, Digitaria, 
40 Phleum, Poa, Festuca, Eleusine, Brachiaria, Lolium, Bromus, Avena, Cyperus, Sor- 
ghum, Agropyron, Cynodon, Monochoria, Fimbristylis, Sagittaria, Eleocharis, Scirpus, 
Paspalum, Ischaemum, Sphenoclea, Dactyloctenium, Agrostis, Alopecurus, Apera. 
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SEQ ID NO: 1; 3, 5, 7, 9, 11, 13, 15, 17 or 19-21 or parts of SEQ ID NO: 1; 3, 5, 7, 9, 
1 1, 13, 15, 17 or 19-21 can be used for the preparation of hybridization probes. The 
preparation of these probes and the experimental procedure is known. For example, 
5 this can be effected via the selective preparation of radioactive or nonradioactive 

probes by PCR and the use of suitably labeled oligonucleotides, followed by hybridiza- 
tion experiments. The technologies required for this purpose are detailed, for example, 
in T. Maniatis, E.F. Fritsch and J. Sambrook, "Molecular Cloning: A Laboratory Man- 
ual 0 , Cold Spring Harbor Laboratory, Cold Spring Harbor, NY (1989). The probes in 
10 question can furthermore be modified by standard technologies (Lit. SDM or random 
mutagenesis) in such a way that they can be employed for further purposes, for exam- 
ple as a probe which hybridizes specifically with mRNA and the corresponding coding 
sequences in order to analyze the corresponding sequences in other organisms. 

15 The abovementioned probes can be used for the detection and isolation of functional 
equivalents of SEQ ID NO:2, 4, 6, 8, 10, 12, 14, 16 or 18 from other plant species on 
the basis of sequence identities. In this context, part or all of the sequence of the SEQ 
ID NO:2 in question is used as a probe for screening a genomic or cDNA library of the 
plant species in question or in a computer search for sequences of functional equiva- 

20 lents in electronic databases. 

Preferred plant species are the undesired plants which have already been mentioned 
at the outset. 

25 The invention furthermore relates to expression cassettes comprising 

a) genetic control sequences in operable linkage with a NCLP sequence; or 

b) additional functional elements, or 

30 

c) a combination of a) and b); 

and to the use of expression cassettes comprising 

35 a) genetic control sequences in operable linkage with a nucleic acid sequence ac- 
cording to the invention, 

b) additional functional elements, or 

40 c) a combination of a) and b); 
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for expressing a CLP, which can be used in in vitro assay systems. Both embodiments 
of the above-described expression cassettes are referred in the following text as ex- 
pression cassette according to the invention. 



5 In a preferred embodiment, an expression cassette according to the invention com- 
prises a promoter at the 5' end of the coding sequence and, at the 3' end, a transcrip- 
tion termination signal and, if appropriate, further genetic control sequences which are 
linked operably with the interposed nucleic acid sequence according to the invention. 

10 The expression cassettes according to the invention are also understood as meaning 
analogs which can be brought about, for example, by a combination of the individual 
nucleic acid sequences on a polynucleotide (multiple constructs), on a plurality of 
polynucleotides in a cell (cotransformation) or by sequential transformation. 

1 5 Advantageous genetic control sequences under point a) for the expression cassettes 
according to the invention or for vectors comprising expression cassettes according to 
the invention are, for example, promoters such as the cos, tac, trp, tet, Ipp, lac, laclq, 
T7, T5, T3, gal, trc, ara, SP6, d-PR or the D-PL promoter, all of which can be used for 
expressing a CLP, in Gram-negative bacterial strains. 

20 

Examples of further advantageous genetic control sequences are present, for example, 
in the promoters amy and SP02, both of which can be used for expressing a CLP, in 
Gram-positive bacterial strains, and in the yeast or fungal promoters AUG1, GPD-1, 
PX6, TEF, CUP1, PGK, GAP1, TPI, PHOS, AOX1, GAL10/CYC1, CYC1, OHC, ADH, 

25 TDH, Kex2, MFA or NMT or combinations of the abovementioned promoters (Degryse 
et al., Yeast 1995 June 15; 11(7):629-40; Romanos et al. Yeast 1992 June;8(6):423- 
88; Benito et al. Eur. J. Plant Pathol. 104, 207-220 (1998); Gregg et al. Biotechnology 
(N Y) 1993 Aug;11(8):905-10; Luo X., Gene 1995 Sep 22; 163(1): 127-31: Nacken et al M 
Gene 1996 Oct 10;175(1-2): 253-60; Turgeon et al., Mol Cell Biol 1987 Sep;7(9):3297- 

30 305) or the transcription terminators NMT, Gcy1 , TrpC, AOX1 , nos, PGK or CYC1 (De- 
gryse et al., Yeast 1995 June 15; 1 1(7):629-40; Brunelli et al. Yeast 1993 (Dec9(12): 
1309-18; Frisch et al., Plant Mol. Biol. 27(2), 405-409 (1995); Scorer et al., Biotechnol- 
ogy (N.Y. 12 (2), 181-184 (1994), Genbank acc. number Z46232; Zhao et al. Genbank 
acc number : AF049064; Punt et al., (1987) Gene 56 (1), 117-124), all of which can be 

35 used for expressing CLP, in yeast strains. 

Examples of genetic control sequences which are suitable for expression in insect cells 
are the polyhedrin promoter and the p10 promoter (Luckow, V.A. and Summers, M.D. 
(1988) Bio/Techn. 6, 47-55). 

40 

Advantageous genetic control sequences for expressing CLP, in cell culture, in addition 
to polyadenylation sequences such as, for example, from simian virus 40, are eu- 
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karyotic promoters of viral origin such as, for example, promoters of the polyoma virus, 
adenovirus 2, cytomegalovirus or simian virus 40. 

Further advantageous genetic control sequences for expressing nuclear encoded Clp 
5 Protease, in plants are present in the plant promoters CaMV/35S [Franck et a!., Cell 
21(1980) 285-294], PRP1 [Ward et al. f Plant. Mol. Biol. 22 (1993)], SSU, OCS, LEB4, 
USP, STLS1, B33, NOS; FBPaseP (WO 98/18940) or in the ubiquitin or phaseolin 
promoter; a promoter which is preferably used being, in particular, a plant promoter or 
a promoter derived from a plant virus. Especially preferred are promoters of viral origin 

10 such as the promoter of the cauliflower mosaic virus 35S transcript (Franck et al., Cell 
21 (1980), 285-294; Odell et al., Nature 313 (1985), 810-812). Further preferred consti- 
tutive promoters are, for example, the agrobacterium nopaline synthase promoter, the 
TR double promoter, the agrobacterium OCS (octopine synthase) promoter, the ubiq- 
uitin promoter, (Holtorf S et al., Plant Mol Biol 1995, 29:637-649), the promoters of the 

15 vacuolar ATPase subunits, or the promoter of a proline-rich wheat protein (WO 
91/13991). 

The expression cassettes may also comprise, as genetic control sequence, a chemi- 
cally inducible promoter, by which the expression of the exogenous gene in the plant 

20 can be controlled at a specific point in time. Such promoters, such as, for example, the 
PRP1 promoter (Ward et al., Plant. Mol. Biol. 22 (1993), 361-366), a salicylic-acid- 
inducible promoter (WO 95/19443), a benzenesulfonamide-inducible promoter (EP-A- 
0388186), a tetracyclin-inducible promoter (Gate et al., (1992) Plant J. 2, 397404), an 
abscisioacid-inducible promoter (EP-A 335528) or an ethanol- or cyclohexanone- 

25 inducible promoter (WO 93/21334) may also be used. 

Furthermore, suitable promoters are those which confer tissue- or organ-specific ex- 
pression in, for example, anthers, ovaries, flowers and floral organs, leaves, stomata, 
trichomes, stems, vascular tissues, roots and seeds. Others which are suitable in addi- 

30 tion to the abovementioned constitutive promoters are, in particular, those promoters 
which ensure leaf-specific expression. Promoters which must be mentioned are the 
potato cytosolic FBPase promoter (WO 97/05900), the rubisco (ribulose-1,5- 
bisphosphate carboxylase) SSU (small subunit) promoter or the ST-LSI promoter from 
potato (Stockhaus et al., EMBO J. 8 (1989), 2445 - 245). Promoters which are further- 

35 more preferred are those which control expression in seeds and plant embryos. Exam- 
ples of seed-specific promoters are the phaseolin promoter (US 5,504,200, Bustos MM 
et al., Plant Cell. 1989;1(9):839-53), the promoter of the 2S albumin gene (Joseffeon 
LG et al., J Biol Chem 1987, 262:12196-12201), the legumin promoter (Shirsat A et al., 
Mol Gen Genet, 1989;215(2):326-331), the USP (unknown seed protein) promoter 

40 (B§umlein H et al., Molecular & General Genetics 1991, 225(3):459-67), the napin 
gene promoter (Stalberg K, et al., L. Planta 1996, 199:515-519), the sucrose binding 



WO 2005/054283 



PCT/EP2004/013555 



27 

protein promoter (WO 00/26388) or the LeB4 promoter (Baumlein H et al., Mol Gen 
Genet 1991, 225: 121-128; Fiedler, U. et al., Biotechnology (NY) (1995), 13 (10) 1090). 

Further promoters which are suitable as genetic control sequences are, for example, 
5 specific promoters for tubers, storage roots or roots, such as, for example, the class I 
patatin promoter (B33), the potato cathepsin D inhibitor promoter, the starch synthase 
(GBSS1) promoter or the sporamin promoter, fruit-specific promoters such as, for ex- 
ample, the fruit-specific promoter from tomato (EP-A 409625), fruit-maturation-specific 
promoters such as, for example, the fruit-maturation-specific promoter from tomato 

10 (WO 94/21 794), inflorescence-specific promoters such as, for example, the phytoene 
synthase promoter (WO 92/16635) or the promoter of the P-rr gene (WO 98/22593), or 
plastid- or chromoplast-specific promoters such as, for example, the RNA polymerase 
promoter (WO 97/06250), or else the Glycine max phosphoribosyl-pyrophosphate ami- 
dotransferase promoter (see also Genbank Accession No. U87999), or another node- 

15 specific promoter as described in EP-A 249676. 

Additional functional elements b) are understood as meaning, by way of example but 
not by limitation, reporter genes, replication origins, selection markers and what are 
known as affinity tags, in fusion with CLP, directly or by means of a linker optionally 
20 comprising a protease cleavage site. Further suitable additional functional elements are 
sequences which ensure that the product is targeted into the apoplasts, into plastids, 
the vacuole, the mitochondrion, the peroxisome, the endoplasmatic reticulum (ER) or, 
owing to the absence of such operative sequences, remains in the compartment where 
it is formed, the cytosol, (Kermode, Crit. Rev. Plant Sci. 15, 4 (1996), 285-423). 

25 

Also in accordance with the invention are vectors comprising at least one copy of the 
nucleic acid sequences according to the invention and/or the expression cassettes ac- 
cording to the invention. 

30 In addition to plasmids, vectors are furthermore also understood as meaning all of the 
other known vectors with which the skilled worker is familiar, such as, for example, 
phages, viruses such as SV40, CMV, baculovirus, adenovirus, transposons, IS ele- 
ments, phasmids, phagemids, cosmids or linear or circular DNA. These vectors can be 
replicated autonomously in the host organism or replicated chromosomally; chromo- 

35 somal replication is preferred. 

In a further embodiment of the vector, the nucleic acid construct according to the inven- 
tion can advantageously also be introduced into the organisms in the form of a linear 
DNA and integrated into the genome of the host organism via heterologous or homolo- 
40 gous recombination. This linear DNA may consist of a linearized plasmid or only of the 
nucleic acid construct as vector, or the nucleic acid sequences used. 
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Further prokaryotic or eukaryotic expression systems are mentioned in Chapters 16 
and 17 in Sambrook et al M "Molecular Cloning: A Laboratory Manual." 2nd ed., Cold 
Spring Harbor Laboratory, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, 
NY, 1989. Further advantageous vectors are described in Hellens et al. (Trends in 
5 plant science, 5, 2000). 

The expression cassette according to the invention and vectors derived therefrom can 
be used for transforming bacteria, cyanobacteria, (for example of the genus Synecho- 
cystis, Anabaena, Calothrix, Scytonema, Oscillatoria, Plectonema and Nostoc), proteo- 
10 bacteria such as, for example, Magnetococcus sp. MC1 , yeasts, filamentous fungi and 
algae and eukaryoatic nonhuman cells (for example insect cells) with the aim of pro- 
ducing CLP, recombinantly, the generation of a suitable expression cassette depending 
on the organism in which the gene is to be expressed. 

15 Vectors comprising a NCLP sequence form part of the subject-matter of the present 
invention. 

In a further advantageous embodiment, the nucleic acid sequences according to the 
invention may also be introduced into an organism by themselves. 

20 

If, in addition to the nucleic acid sequences, further genes are to be introduced into the 
organism, they can all be introduced into the organism together in a single vector, or 
each individual gene can be introduced into the organism in each case in one vector, it 
being possible to introduce the different vectors simultaneously or in succession. 

25 

In this context, the introduction, into the organisms in question (transformation), of the 
nucleic acid(s) according to the invention, of the expression cassette or of the vector 
can be effected in principle by all methods with which the skilled worker is familiar. 

30 In the case of microorganisms, the skilled worker will find suitable methods in the text- 
books by Sambrook, J. et al. (1989) "Molecular cloning: A laboratory manual", Cold 
Spring Harbor Laboratory Press, von F.M. Ausubel et al. (1994) "Current protocols in 
molecular biology", John Wiley and Sons, by D.M. Glover et al., DNA Cloning Vol.1, 
(1995), IRL Press (ISBN 019-963476-9), by Kaiser et al. (1994) Methods in Yeast Ge- 

35 netics, Cold Spring Habor Laboratory Press or Guthrie et al. "Guide to Yeast Genetics 
and Molecular Biology", Methods in Enzymology, 1994, Academic Press. In the trans- 
formation of filamentous fungi, the methods of choice are firstly the generation of pro- 
toplasts and transformation with the aid of PEG (Wiebe et al. (1997) MycoL Res. 101 
(7): 971-877; Proctor etal. (1997) Microbiol. 143, 2538-2591), and secondly the trans- 

40 formation with the aid of Agrobacterium tumefaciens (de Groot et al. (1998) Nat. Bio- 
tech. 16, 839-842). 
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In the case of dicots, the methods which have been described for the transformation 
and regeneration of plants from plant tissues or plant cells can be exploited for tran- 
sient or stable transformation. Suitable methods are the biolistic method or the trans- 
formation of protoplasts (cf., for example, Willmitzer, L, 1993 Transgenic plants. In: 
5 Biotechnology, A Multi-Volume Comprehensive Treatise (H.J. Rehm, G. Reed, A. 
Puhler, P. Stadler, eds.), Vol. 2, 627-659, VCH Weinheim-New York-Basle- 
Cambridge), electroporation, the incubation of dry embryos in DNA-containing solution, 
microinjection and the agrobacterium-radiated gene transfer. The abovementioned 
methods are described, for example, in B. Jenes et al., Techniques for Gene Transfer, 
10 in: Transgenic Plants, Vol. 1 , Engineering and Utilization, edited by S.D. Kung and R. 
Wu, Academic Press (1993) 128-143 and in Potrykus, Annu. Rev. Plant Physiol. Plant 
MolecBiol. 42 (1991) 205-225). 

The transformation by means of agrobacteria, and the vectors to be used for the trans- 
1 5 formation, are known to the skilled worker and described extensively in the literature 
(Bevan et al., Nucl. Acids Res. 12 (1984) 8711. The intermediary vectors can be inte- 
grated into the agrobacterial Ti or Ri plasmid by means of homologous recombination 
owing to sequences which are homologous to sequences in the T-DNA. This plasmid 
additionally contains the vir region, which is required for the transfer of the T-DNA. In- 
20 termediary vectors are not capable of replication in agrobacteria. The intermediary vec- 
tor can be transferred to Agrobacterium tumefaciens by means of a helper plasmid 
(conjugation). Binary vectors are capable of replication both in E. coli and in agrobacte- 
ria. They contain a selection marker gene and a linker or polylinker which are framed 
by the right and left T-DNA border region. They can be transformed directly into the 
25 agrobacteria (Holsters et al. Mol. Gen. Genet. 163 (1978), 181-187), EP A 0 120 516; 
Hoekema, in: The Binary Plant Vector System Offeetdrukkerij Kanters B.V., Alblasser- 
dam (1985), Chapter V; Fraley et al., Crit. Rev. Plant. Sci., 4: 1-46 and An et al. EMBO 
J. 4 (1985), 277-287). 



30 The transformation of monocots by means of vectors based on agrobacterium has also 
been described (Chan et al., Plant Mol. Biol. 22(1993), 491-506; Hiei et al., Plant J. 6 
(1994) 271-282; Deng et al. Science in China 33 (1990), 28-34; Wilmink et al., Plant 
Cell Reports 11,(1992) 76-80; May et al. Biotechnology 13 (1995) 486-492; Conner and 
Domisse; Int. J. Plant Sci. 153 (1992) 550-555; Ritchie et al. Transgenic Res. (1993) 

35 252-265). Alternative systems for the transformation of monocots are the transforma- 
tion by means of biolistic approach (Wan and Lemaux; Plant Physiol. 104 (1994), 37- 
48; Vasil etal. Biotechnology 11 (1992), 667-674; Ritala et al., Plant Mol. Biol 24, 
(1994) 317-325; Spencer et al., Theor. Appl. Genet. 79 (1990), 625-631), protoplast 
transformation, the electroporation of partially permeabilized ceils, and the introduction 

40 of DNA by means of glass fibers. In particular the transformation of maize has been 
described repeatedly in the literature (cf., for example, WO 95/06128; EP 0513849 A1; 
EP 0465875 A1; EP 0292435 A1; Fromm et al., Biotechnology 8 (1990), 833-844; 
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Gordon-Kamm et al M Plant Cell 2 (1990), 603-618; Koziel et al., Biotechnology 
11(1993) 194-200; Moroc et al., Theor Applied Genetics 80 (190) 721-726). 

The successful transformation of other cereal species has also already been described 
5 for example in the case of barley (Wan and Lemaux, see above; Ritala et al., see 
above; wheat (Nehra et al., Plant J. 5(1994) 285-297). 

Agrobacteria which have been transformed with a vector according to the invention can 
likewise be used in a known manner for the transformation of plants, such as test 

10 plants like Arabidopsis or crop plants like cereals, maize, oats, rye/barley, wheat, soya, 
rice, cotton, sugarbeet, canola, sunflower, flax, hemp, potato, tobacco, tomato, carrot, 
capsicum, oilseed rape, tapioca, cassava, arrowroot, Tagetes, alfalfa, lettuce and the 
various tree, nut and grapevine species, for example by bathing scarified leaves or leaf 
segments in an agrobacterial solution and subsequently growing them in suitable me- 

15 dia. 

The genetically modified plant cells can be regenerated via all methods with which the 
skilled worker is familiar. Such methods can be found in the abovementioned publica- 
tions by S.D. Kung and R. Wu, Potrykus or HOfgen and Willmitzer. 

20 

The transgenic organisms generated by transformation with one of the above- 
described embodiments of an expression cassette comprising a nucleic acid sequence 
according to the invention or a vector comprising the abovementioned expression cas- 
sette, and the recombinant CLP, which can be obtained from the transgenic organism 
25 by means of expression, form part of the subject matter of the present invention. The 
use of transgenic organisms comprising an expression cassette according to the inven- 
tion, for example for providing recombinant protein, and/or the use of these organisms 
in in-vivo assay systems likewise form part of the subject matter of the present inven- 
tion. 

30 

Preferred organisms for the recombinant expression are not only bacteria, yeasts, 
mosses, algae and fungi, but also eukaryotic cell lines. 

Preferred mosses are Physcomitrella patens or other mosses described in Kryptoga- 
35 men [Cryptogamia], Vol.2, Moose, Fame [Mosses, Ferns], 1991, Springer Verlag (ISBN 
3540536515). 

Preferred within the bacteria are, for example, bacteria from the genus Escherichia, 
Erwinia, Flavobacterium, Alcaligenes or cyanobacteria, for example from the genus 
40 Synechocystis, Anabaena, Calothrix, Scytonema, Oscillatoria, Plectonema and Nostoc, 
especially preferably Synechocystis or Anabaena. 
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Preferred yeasts are Candida, Saccharomyces, Schizosaccheromyces, Hansenula or 
Pichia. 

Preferred fungi are Aspergillus, Trichoderma, Ashbya, Neurospora, Fusarium, Beauve- 
5 ria, Mortierella, Saprolegnia, Pythium, or other fungi described in Indian Chem Engr. 
Section B. Vol 37, No 1 ,2 (1 995). 

Preferred plants are selected in particular among monocotyledonous crop plants such 
as, for example, cereal species such as wheat, barley, sorghum or millet, rye, triticale, 

10 maize, rice or oats, and sugarcane. The transgenic plants according to the invention 
are, furthermore, in particular selected from among dicotyledonous crop plants such as, 
for example, Brassicaceae such as oilseed rape, cress, Arabidopsis, cabbages or ca- 
nola; Leguminosae such as soyabean, alfalfa, pea, beans or peanut, Solanaceae such 
as potato, tobacco, tomato, egg plant or capsicum; Asteraceae such as sunflower, 

15 Tagetes, lettuce or Calendula; Cucurbitaceae such as melon, pumpkin/squash or zuc- 
chini, or linseed, cotton, hemp, flax, red pepper, carrot, sugar beet, or various tree, nut 
and grapevine species. 

In principle, transgenic animals such as, for example, C. elegans, are also suitable as 
20 host organisms. 

Also preferred is the use of expression systems and vectors which are available to the 
public or commercially available. 

25 Those which must be mentioned for use in E. coli bacteria are the typical advanta- 
geous commercially available fusion and expression vectors pGEX [Pharmacia Biotech 
Inc; Smith, D.B. and Johnson, K.S. (1988) Gene 67:31-40], pMAL (New England Bio- 
labs, Beverly, MA) and pRIT5 (Pharmacia, Piscataway, NJ), which contains glutathione 
S transferase (GST), maltose binding protein or protein A, the pTrc vectors (Amann et 

30 al., (1988) Gene 69:301-315), u pKK233-2 M from CLONTECH, Palo Alto, CA and the 
"pET\ and the "pBAD" vector series from Stratagene, La Jolla and the TOPO-TA vec- 
tor series drom Invitrogen. 

Further advantageous vectors for use in yeast are pYepSed (Baldari, et al., (1987) 
35 Embo J. 6:229-234), pMFa (Kurjan and Herskowitz, (1982) Cell 30:933-943), pJRY88 
(Schultz et al., (1987) Gene 54:113-123), and pYES derivatives, pGAPZ derivatives, 
pPICZ derivatives, and the vectors of the "Pichia Expression Kit" (Invitrogen Corpora- 
tion, San Diego, CA). Vectors for use in filamentous fungi are described in: van den 
Hondel, C.A.M.J.J. & Punt, P.J. (1991) "Gene transfer systems and vector develop- 
40 ment for filamentous fungi, in: Applied Molecular Genetics of Fungi, J.F. Peberdy, et 
al., eds., p. 1-28, Cambridge University Press: Cambridge. 
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As an alternative, insect cell expression vectors may also be used advantageously, for 
example for expression in Sf9, Sf21 or Hi5 cells, which are infected via recombinant 
Baculoviruses. Examples of these are the vectors of the pAc series (Smith et al. (1983) 
Mol. Cell Biol. 3:2156-2165) and the pVL series (Lucklow and Summers (1989) Virol- 
5 ogy 170:31-39). Others which may be mentioned are the Baculovirus expression sys- 
tems "MaxBac 2.0 Kif and "Insect Select System" from Invitrogen, Carlsbad or 
"BacPAK Baculovirus Expression System" from CLONTECH, Palo Alto, CA. Insect 
cells are particularly suitable for overexpressing eukaryotic proteins since they effect 
posttranslational modifications of the proteins which are not possible in bacteria and 
10 yeasts. The skilled worker is familiar with the handling of cultured insect cells and with 
their infection for expressing proteins, which can be carried out analogously to known 
methods (Luckow and Summers, Bio/Tech. 6, 1988, pp.47-55; Glover and Hames 
(eds) in DNA Cloning 2, A practical Approach, Expression Systems/Second Edition, 
Oxford University Press, 1995, 205-244). 

15 

Plant cells or algal cells are others which can be used advantageously for expressing 
genes. Examples of plant expression vectors can be found as mentioned above in 
Becker, D., et al. (1992) "New plant binary vectors with selectable markers located 
proximal to the left border", Plant Mol. Biol. 20: 1195-1197 or in Bevan, M.W. (1984) 
20 "Binary Agrobacterium vectors for plant transformation", Nucl. Acid. Res. 12: 871 1- 
8721. 

Moreover, the nucleic acid sequences according to the invention can be expressed in 
mammalian cells. Examples of suitable expression vectors are pCDM8 and pMT2PC, 

25 which are mentioned in: Seed, B. (1987) Nature 329:840 or Kaufman et al. (1987) 

EMBO J. 6:187-195). Promoters preferably to be used in this context are of viral origin 
such as, for example, promoters of polyoma virus, adenovirus 2, cytomegalovirus or 
simian virus 40. Further prokaryotic and eukaryotic expression systems are mentioned 
in Chapter 16 and 17 in Sambrook et al., Molecular Cloning: A Laboratory Manual. 2nd 

30 ed., Cold Spring Harbor Laboratory, Cold Spring Harbor Laboratory Press,. Cold Spring 
Harbor, NY, 1989. Further advantageous vectors are described in Hellens et al. 
(Trends in plant science, 5, 2000). 

The transgenic organisms which comprise a NCLP sequence are claimed within the 
35 scope of the present invention. 

All of the above-described embodiments of the transgenic organisms, which comprise 
at least one nucleic acid sequence according to the invention come under the term 
"transgenic organism according to the invention". 

40 

The present invention furthermore relates to the use of CLP, in a method for identifying 
herbicidally active test compounds. 
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The method according to the invention for identifying herbicidally active compounds 
preferably comprises the following steps: 

i. bringing CLP into contact with one or more test compounds under conditions 
which permit the test compound(s) to bind to a nucleic acid sequence according 
to the invention or to CLP, and 

ii. detecting whether the test compound binds to the CLP of i), or 

iii. detecting whether the test compound reduces or blocks the enzymatic or biologi- 
cal activity of CLP of i), or 

iv. detecting whether the test compound reduces or blocks the transcription, transla- 
tion or expression of CLP of i). 

The detection in accordance with step (ii) of the above method can be effected using 
techniques which identify the interaction between the polypeptide and ligand. In this 
context, either the test compound or the enzyme can contain a detectable label such 
as, for example, a fluorescent label, a radioisotope, a chemiluminescent label or an 
enzyme label. Examples of enzyme labels are horseradish peroxidase, alkaline phos- 
phatase or luciferase. The subsequent detection depends on the label and is known to 
the skilled worker. 

25 In this context, five preferred embodiments which are also suitable for high-throughput 
methods (HTS) in connection with the present invention must be mentioned in particu- 
lar: 

1 - The average diffusion rate of a fluorescent molecule as a function of the mass 
30 can be determined in a small sample volume via fluorescence correlation spec- 

troscopy (FCS) (Proc. Natl. Acad. Sci. USA (1994) 11753-11575). FCS can be 
employed for determining protein/ligand interactions by measuring the change in 
the mass, or the changed diffusion rate which this entails, of a test compound 
when binding to CLP. A method according to the invention can be designed di- 
35 rectly for measuring the binding of a test compound labeled by a fluorescent 

molecule. As an alternative, the method according to the invention can be de- 
signed in such a way that a chemical reference compound which is labeled by a 
fluorescent molecule is displaced by further test compounds ("displacement as- 
say"). 

40 

2. Fluoresence polarization exploits the characteristic of a quiescent fluorophore 
excited with polarized light to likewise emit polarized light. If, however, the fluoro- 
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phore is allowed to rotate during the excited state, the polarization of the fluores- 
cent light which is emitted is more or less lost Under otherwise identical condi- 
tions (for example temperature, viscosity, solvent), the rotation is a function of 
molecule size, whereby findings regarding the size of the fluorophore-bound resi- 
5 due can be obtained via the reading (Methods in Enzymology 246 (1995), pp. 

283-300). A method according to the invention can be designed directly for 
measuring the binding of a test compound labeled with a fluorescent molecule to 
the CLP. As an alternative, the method according to the invention may also take 
the form of the "displacement assay 0 described under 1. 

10 

3. Fluorescence resonance energy transfer (FRET) is based on the irradiation-free 
energy transfer between two spatially adjacent fluorescent molecules under suit- 
able conditions. A prerequisite is that the emission spectrum of the donor mole- 
cule overlaps with the excitation spectrum of the acceptor molecule. The fluores- 

15 cent label of CLP, and binding test compound, the binding can be measured by 

means of FRET (Cytometry 34, 1998, pp. 159-179). As an alternative, the 
method according to the invention may also take the form of the "displacement 
assay" described under 1 . An especially suitable embodiment of FRET technol- 
ogy is "Homogeneous Time Resolved Fluorescence" (HTRF) as can be obtained 

20 from Packard Bioscience. 

Surface-enhanced laser desorption/ionization (SELDI) in combination with a time- 
of-flight mass spectrometer (MALDI-TOF) makes possible the rapid analysis of 
molecules on a support and can be used for analyzing protein/ligand interactions 
(Worral et al., (1998) Anal. Biochem. 70:750-756). In a preferred embodiment, 
CLP, is immobilized on a suitable support and incubated with the test compound. 
After one or more suitable wash steps, the test compound molecules which are 
additionally bound to CLP, can be detected by means of the abovementioned 
methodology and test compounds which are bound to CLP, can thus be selected. 

The measurement of surface plasmon resonance is based on the change in the 
refractive index at a surface when a test compound binds to a protein which is 
immobilized to said surface. Since the change in the refractive index is identical 
for virtually all proteins and polypeptides for a defined change in the mass con- 
centration at the surface, this method can be applied to any protein in principle 
(Lindberg et al. Sensor Actuators 4 (1983) 299-304; Malmquist Nature 361 
(1993) 186-187). The measurement can be carried out for example with the 
automatic analyzer based on surface plasmon resonance which is available from 
Biacore (Freiburg) at a throughput of, currently, up to 384 samples per day. A 
method according to the invention can be designed directly for measuring the 
binding of a test compound to CLP. As an alternative, the method according to 
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the invention may also take the form of the "displacement assay" described under 

1. 

The compounds identified via the abovementioned methods 1 to 5 may be suitable as 
5 inhibitors. All of the substances identified via the abovementioned methods can subse- 
quently be checked for their herbicidal action in another embodiment of the method 
according to the invention. 

Furthermore, there exists the possibility of detecting further candidates for herbicidal 
active ingredients by molecular modeling via elucidation of the three-dimensional struc- 
ture of CLP, by x-ray structure analysis. The preparation of protein crystals required for 
x-ray structure analysis, and the relevant measurements and subsequent evaluations 
of these measurements, the detection of a binding site in the protein, and the prediction 
of potential inhibitor structures are known to the skilled worker. In principle, an optimi- 
zation of the compound identified by the abovementioned methods is also possible via 
molecular modeling. 

A preferred embodiment of the method according to the invention, which is based on 
steps i) and ii), consists in selecting a test compound which reduces or blocks the activ- 
ity of the CLP. Preferably, the activity of the CLP, incubated with the test compound is 
herein compared with the activity of a CLP, not incubated with a test compound. 

A more preferred embodiment of the method based on steps i) and ii) consists in 

i. expressing CLP in a transgenic organism according to the invention or growing 
an organism which naturally contains a CLP, 

ii. bringing CLP, of step i) in the cell digest of the transgenic or nontransgenic or- 
ganism, in partially purified or in homogeneously purified form, into contact with a 
test compound; and 

iii. selecting a compound which reduces or blocks the activity of the nuclear en- 
coded Clp Protease. Preferably the activity of CLP incubated with the test com- 
pound is herein compared with the activity of a CLP, not incubated with a test 

35 compound. 

The solution containing the CLP, can consist of the lysate of the original organism or of 
the transgenic organism which has been transformed with an expression cassette ac- 
cording to the invention. If necessary, the CLP, can be purified partially or fully via cus- 
40 tomary methods. A general overview over current protein purification techniques is de- 
scribed, for example, in Ausubel, F.M. et al., Current Protocols in Molecular Biology, 
Greene Publishing Assoc. and Wiley-lnterscience (1994); ISBN 0-87969-309-6. In the 
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case of recombinant preparation, the protein which has been fused with an affinity tag 
can be purified via affinity chromatography as is known to the skilled worker. 

The CLP, which is required for in vitro methods can thus be isolated either by means of 
5 heterologous expression from a transgenic organism according to the invention or from 
an organism containing CLP, for example from an undesired plant, the term "undesired 
planf being understood as meaning the species mentioned at the outset. 

To identify herbicidal compounds, the CLP, is now incubated with a test compound. 

1 0 After a reaction time, the enzymatic activity of the CLP, incubated with the test com- 
pound is determined in comparison with a CLP, not incubated with a test compound. If 
the CLP, is inhibited, a significant decrease in activity in comparison with the activity of 
the noninhibited polypeptide according to the invention is observed, the result being a 
reduction of at least 10%, advantageously at least 20%, preferably at least 30%, espe- 

15 daily preferably by at least 50%, up to 100% reduction (blocking). Preferred is an inhi- 
bition of at least 50% at test compound concentrations of 10^M, preferably at lO^M, 
especially preferably of lO^M, based on enzyme concentration in the micromolar 
range. 

20 The enzymatic activity of CLP, can be determined for example by an activity assay in 
which the increase of the product, the decrease of the substrate (or starting material) or 
the decrease or increase of the cofactor are determined, or by a combination of at least 
two of the abovementioned parameters, as a function of a defined period of time. 

25 Examples of suitable substrates are, for example small peptides and modified small 

peptides like peptides coupled to a fluorogenic molecule such as aminomethylcoumarin 
and succinylated peptides. 

If appropriate, derivatives of the abovementioned compounds which contain a detect- 
30 able label such as, for example, a fluorescent label (e.g. fluorogenic substrates such as 
N-Suc-Leu-Tyr-(7-amino-4-methylcoumarine) (SLT-AMC), Suc-Ala-Ala-Ala-AMC, Suc- 
Leu-Leu-Val-Tyr-AMC, Suc-Ala-Ala-Phe-AMC, Suc-lle-lle-Trp-AMC, Suc-AIa-Phe-Lys- 
AMC), a radioisotope label or a chemiluminescent label, may also be used. 

35 The amounts of substrates to be employed in the activity tests may range between 0.5 
and 100 mM, based on 1-100 pg/ml enzyme. 
[ 

The activity can be determined for example by tracking Proteolysis fluorimetrically 
when using fluorogenic Peptide substrates analogously to the method described by 
40 Woo et al. 1 989 The Journal of Biological Chemistry 264, pp.2088-2091 , which is 
herein incorporated by reference. 
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The activity may also be determined in an ATP-dependent fashion in the presence of 
ClpA, CIpB or CIpC Protein as described in Halperin et al. 2001, Planta 213, pp. 614- 
619. The preferred Substrate is then b-casein. 

5 Furthermore the activity may be measured by HPLC and HPLC-MS mehtods detecting 
fragments of the peptides used as substrates. 

Another preferred embodiment of the method according to the invention which is based 
on steps i) and iii) consists of the following steps: 

10 

i. generating a transgenic organism according to the invention comprising a nucleic 
acid sequence according to the invention, wherein CLP is expressed recombi- 
nantly; 

1 5 ii. applying a test compound to the transgenic organism of i) and to a nontransgenic 
organism of the same species; 

iii. determining the growth or the viability of the transgenic and the nontransgenic 
organisms after application of the test substance, and 

20 

iv. selecting test compounds which bring about a reduced growth or a limited viabil- 
ity of the nontransgenic organism in comparison with the growth of the transgenic 
organism. 

25 In this context, the difference in growth in step iv) for the selection of a herbicidally ac- 
tive inhibitor amounts to at least 10%, by preference 20%, preferably 30%, especially 
preferably 40% and very especially preferably 50%. 

The transgenic organism in this context is preferably a plant, an alga, a cyanobacte- 
30 rium, for example of the genus Synechocystis or a proteobacterium such as, for exam- 
pie, Magnetococcus sp. MC1, preferably plants which can be transformed by means of 
customary techniques, such as Arabidopsis thaliana Allium cepa, Ananas comosus, 
Arachis hypogaea, Asparagus officinalis, Beta vulgaris spec, altissima, Beta vulgaris 
spec, rapa, Brassica napus var. napus, Brassica napus var. napobrassica, Brassica 
35 rapa var. silvestris, Camellia sinensis, Carthamus tinctorius, Carya illinoinensis, Citrus 
limon, Citrus sinensis, Coffea arabica (Coffea canephora, Coffea liberica), Cucumis 
sativus, Cynodon dactylon, Daucus carota, Elaeis guineensis, Fragaria vesca, Glycine 
max, Gossypium hirsutum, (Gossypium arboreum, Gossypium herbaceum, Gossypium 
vitifolium), Helianthus annuus, Hevea brasiliensis, Hordeum vulgare, Humulus lupulus, 
40 Ipomoea batatas, Juglans regia, Lens culinaris, Linum usitatissimum, Lycopersicon 

lycopersicum, Malus spec, Manihot esculenta, Medicago sativa, Musa spec, Nicotiana 
tabacum (N.rustica), Olea europaea, Oryza sativa, Phaseolus lunatus, Phaseolus vul- 
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garis, Picea abies, Pinus spec, Pisum sativum, Prunus avium, Pnjnus persica, Pyrus 
communis, Ribes sylvestre, Ricinus communis, Saccharum officinarum, Secale ce- 
reals, Solanum tuberosum, Sorghum bicolor (s. vulgare), Theobroma cacao, Trifolium 
pratense, Triticum aestivum, Triticum durum, Vicia faba, Vitis vinifera, Zea mays, or 
5 cyanobacteria which can be transformed readily, such as Synechocystis, into which the 
sequence encoding a polypeptide according to the invention has been incorporated by 
transformation. These transgenic organisms thus show increased tolerance to com- 
pounds which inhibit the polypeptide according to the invention. "Knock-ouf mutants in 
which the analogous CLProtease gene which is naturally present in this organism has 
10 been selectively switched off may also be used. 

However, the abovementioned embodiment of the method according to the invention 
can also be used for identifying substances with a growth-regulatory action. In this con- 
text, the transgenic organism employed is a plant. The method for identifying sub- 
15 stances with growth-regulatory activity thus comprises the following steps: 

i. generating a transgenic plant comprising a nucleic acid sequence according to 
the invention encoding CLP, wherein CLP is expressed recombinantly; 

20 ii. applying a test substance to the transgenic plant of i) and to a nontransgenic 
plant of the same variety, 

iii. determining the growth or the viability of the transgenic plant and the nontrans- 
genic plant after application of the test compound, and 

25 

iv. selecting test substances which bring about a reduced growth of the nontrans- 
genic plant in comparison with the growth of the transgenic plant. 

Here, step iv) involves the selection of test compounds which bring about a modified 
30 growth of the nontransgenic organism in comparison with the growth of the transgenic 
organism. Modified growth is understood as meaning, in this context, inhibition of the 
vegetative growth of the plants, which can manifest itself in particular in reduced longi- 
tudinal growth. Accordingly, the treated plants show stunted growth; moreover, their 
leaves are darker in color. In addition, modified growth is also understood as meaning 
35 a change in the course of maturation over time, the inhibition or promotion of lateral 

branched growth of the plants, shortened or extended developmental stages, increased 
standing ability, the growth of larger amounts of buds, flowers, leaves, fruits, seed ker- 
nels, roots and tubers, an increased sugar content in plants such as sugarbeet, sugar 
cane and citrus fruit, an increased protein content in plants such as cereals or soybean, 
40 or stimulation of the latex flow in rubber trees. The skilled worker is familiar with the 
detection of such modified growth. 



WO 2005/054283 



PCT/EP2004/013555 



39 

It is also possible, in the method according to the invention, to employ a plurality of test 
compounds in a method according to the invention. If a group of test compounds affect 
the target, then it is either possible directly to isolate the individual test compounds or 
to divide the group of test compounds into a variety of subgroups, for example when it 
5 consists of a multiplicity of different components, in order to thus reduce the number of 
the different test compounds in the method according to the invention. The method 
according to the invention is then repeated with the individual test compound or the 
relevant subgroup of test compounds. Depending on the complexity of the sample, the 
above-described steps can be carried out repeatedly, preferably until the subgroup 
10 identified in accordance with the method according to the invention only comprises a 
small number of test compounds, or indeed just one test compound. 

All of the above-described methods for identifying inhibitors with herbicidal or growth- 
regulatory activity are hereinbelow referred to as "methods according to the invention n 

15 

All of the compounds which have been identified via the methods according to the in- 
vention can subsequently be tested in vivo for their herbicidal and growth-regulatory 
activity. One possibility of testing the compounds for herbicidal action is to use duck- 
weed, Lemna minor, in microtiter plates. Parameters which can be measured are 
20 changes in the chlorophyll content and the photosynthesis rate. It is also possible to 
apply the compound directly to undesired plants, it being possible to identify the herbi- 
cidal action for example via restricted growth. 

The method according to the invention can advantageously also be carried out in high- 
25 throughput methods, known as HTS, which makes possible the simultaneous testing of 
a multiplicity of different compounds. 

The use of supports which contain one or more of the nucleic acid molecules according 
to the invention, one or more of the vectors containing the nucleic acid sequence ac- 
30 cording to the invention, one or more transgenic organisms containing at least one of 
the nucleic acid sequences according to the invention or one or more (poly)peptides 
encoded via the nucleic acid sequences according to the invention lends itself to carry- 
ing out HTS in practice. 

35 Supports which contain one or more of the NCLP sequences, one or more of the vec- 
tors comprising the NCLP sequences one or more transgenic organisms containing at 
least one NCLP sequences or one or more (polypeptides encoded by the NCLP se- 
quences are part of the present invention. 

40 The support used can be solid or liquid, but is preferably solid and especially preferably 
a microtiter plate. The abovementioned supports also form part of the subject matter of 
the present invention. In accordance with the most widely used technique, 96-well, 
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384-well and 1536-well microtiter plates which, as a rule, can comprise volumes of 200 
□I, are used. Besides the microtiter plates, the further components of an HTS system 
which match the corresponding microtiter plates, such as a large number of instru- 
ments, materials, automatic pipetting devices, robots, automated plate readers and 
5 plate washers, are commercially available. 



In addition to the HTS systems based on microtiter plates, what are known as "free- 
format assays" or assay systems where no physical barriers exist between the sam- 
ples, as described, for example, in Jayaickreme et al., Proc. Natl. Acad. Sci U.SA 19 
10 (1994) 161418; Chelsky, "Strategies for Screening Combinatorial Libraries", First An- 
nual Conference of The Society for Biomolecular Screening in Philadelphia, Pa. (Nov. 
710, 1995); Salmon et al„ Molecular Diversity 2 (1996), 5763 and US 5,976,813, may 
also be used. 

15 The invention furthermore relates to herbicidally active compounds identified by the 
methods according to the invention. These compounds are herein below referred to as 
"selected compounds". They have a molecular weight of less than 1000 g/mol, advan- 
tageously less than 500 g/mol, preferably less than 400 g/mol, especially preferably 
less than 300 g/mol. Herbicidally active compounds have a Ki value of less than 1 mM, 

20 preferably less than 1 pM, especially preferably less than 0.1 pM, very especially pref- 
erably less than 0.01 pM. 

Examples for herbicidally active compounds identified with the above mentioned HTS 
methods are the compounds of the formula: 




formula (I) 



formula (II) 
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formula (III). 



10 

The invention furthermore relates to compounds with growth-regulatory activity identi- 
fied by the methods according to the invention. These compounds too are hereinbelow 
referred to as "selected compounds". 

1 5 Naturally, the selected compounds can also be present in the form of their agriculturally 
useful salts. Agriculturally useful salts which are suitable are mainly the salts of those 
cations, or the acid addition salts of those acids, whose cations, or anions, do not ad- 
versely affect the herbicidal action of the herbicidally active compounds identified via 
the methods according to the invention. 

20 

If the selected compounds contain asymmetrically substituted D-carbon atoms, they 
may furthermore also be present in the form of racemates, enantiomer mixtures, pure 
enantiomers or, if they have chiral substituents, also in the form of diastereomer mix- 
tures. 

25 

The selected compounds can be chemically synthesized substances or substances 
produced by microbes and can be found, for example, in cell extracts of, for example, 
plants, animals or microorganisms. The reaction mixture can be a cell-free extract or 
comprise a cell or cell culture. Suitable methods are known to the skilled worker and 
30 are described generally for example in Alberts, Molecular Biology the cell, 3rd Edition 
(1994), for example chapter 17. The selected compounds may also originate from 
comprehensive substance libraries. 

Candidate test compounds can be expression libraries such as, for example, cDNA 
35 expression libraries, peptides, proteins, nucleic acids, antibodies, small organic sub- 
stances, hormones, PNAs or the like (Milner, Nature Medicin 1 (1995), 879-880; Hupp, 
Cell. 83 (1 995), 237-245; Gibbs, Cell. 79 (1 994), 1 93-1 98 and references cited 
therein). 

40 The selected compounds can be used for controlling undesired vegetation and/or as 
growth regulators. Herbicidal compositions comprising the selected compounds afford 
very good control of vegetation on noncrop areas. In crops such as wheat, rice, maize, 
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soybean and cotton, they act against broad-leaved weeds and grass weeds without 
inflicting any significant damage on the crop plants. This effect is observed in particular 
at low application rates. The selected compounds can be used for controlling the harm- 
ful plants which have already been mentioned above. 

5 

Depending on the application method in question, selected compounds, or herbicidal 
compositions comprising them, can advantageously also be employed in a further 
number of crop plants for eliminating undesired plants. Examples of suitable crops are: 

10 Allium cepa, Ananas comosus, Arachis hypogaea, Asparagus officinalis, Beta vulgaris 
spec, altissima, Beta vulgaris spec, rapa, Brassica napus var. napus, Brassica napus 
var. napobrassica, Brassica rapa var. silvestris, Camellia sinensis, Carthamus tincto- 
rius, Carya illinoinensis, Citrus limon, Citrus sinensis, Coffea arabica (Coffea can- 
ephora, Coffea liberica), Cucumis sativus, Cynodon dactylon, Daucus carota, Elaeis 

15 guineensis, Fragaria vesca, Glycine max, Gossypium hirsutum, (Gossypium arboreum, 
Gossypium herbaceum, Gossypium vitifolium), Helianthus annuus, Hevea brasiliensis, 
Hordeum vulgare, Humulus lupulus, Ipomoea batatas, Juglans regia, Lens culinaris, 
Linum usitatissimum, Lycopersicon lycopersicum, Malus spec, Manihot esculenta, 
Medicago sativa, Musa spec, Nicotiana tabacum (N.rustica), Olea europaea, Oryza 

20 sativa, Phaseolus lunatus, Phaseolus vulgaris, Picea abies, Pinus spec., Pisum sati- 
vum, Prunus avium, Prunus persica, Pyrus communis, Ribes sylvestre, Ricinus com- 
munis, Saccharum officinarum, Secale cereale, Solanum tuberosum, Sorghum bicolor 
(s. vulgare), Theobroma cacao, Trifolium pratense, Triticum aestivum, Triticum durum, 
Vicia faba, Vitis vinifera, Zea mays. 

25 

In addition, the selected compounds can also be used in crops which tolerate the ac- 
tion of herbicides owing to breeding, including recombinant methods. The generation of 
such crops is described hereinbelow. 

30 The invention furthermore relates to a method of preparing the herbicidal or growth- 
regulatory composition which has already been mentioned above, which comprises 
formulating selected compounds with suitable auxiliaries to give crop protection prod- 
ucts. 

35 The selected compounds can be formulated for example in the form of directly spray- 
able aqueous solutions, powders, suspensions, also highly concentrated aqueous, oily 
or other suspensions or suspoemulsions or dispersions, ernulsifiable concentrates, 
emulsions, oil dispersions, pastes, dusts, materials for spreading or granules, and ap- 
plied by means of spraying, atomizing, dusting, spreading or pouring. The use forms 

40 depend on the intended use and the nature of the selected compounds; in any case, 
they should guarantee the finest possible distribution of the selected compounds. The 
herbicidal compositions comprise a herbicidally active amount of at least one selected 



WO 2005/054283 PCT/EP2004/0 13555 

43 

compound and auxiliaries conventionally used in the formulation of herbicidal composi- 
tions. 

For the preparation of emulsions, pastes or aqueous or oily formulations and dispersi- 
5 bie concentrates (DC), the selected compounds can be dissolved or dispersed in an oil 
or solvent, it being possible to add further formulation auxiliaries for homogenization. 
However, it is also possible to prepare liquid or solid concentrates from selected com- 
pound, if appropriate solvents or oil and, optionally, further auxiliaries comprising liquid 
or solid concentrates, and these concentrates are suitable for dilution with water. The 

10 following can be mentioned: emulsifiable concentrates (EC, EW), suspensions (SC), 
soluble concentrates (SL), dispersible concentrates (DC), pastes, pills, wettable pow- 
ders or granules, it being possible for the solid formulations either to be soluble or dis- 
persible (wettable) in water. In addition, suitable powders or granules or tablets can 
additionally be provided with a solid coating which prevents abrasion or premature re- 

1 5 lease of the active ingredient. 

In principle, the term "auxiliaries 11 is understood as meaning the following classes of 
compounds: antifoams, thickeners, wetting agents, tackifiers, dispersants, emulsifiers, 
bactericides and/or thixotropic agents. The skilled worker is familiar with the meaning of 
20 the abovementioned agents. 

SLs, EWs and ECs can be prepared by simply mixing the ingredients in question; pow- 
ders can be prepared by mixing or grinding in specific types of mills (for example ham- 
mer mills). DCs, SCs and SEs are usually prepared by wet milling, it being possible to 

25 prepare an SE from an SC by addition of an organic phase which may comprise further 
auxiliaries or selected compounds. The preparation is known. Powders, materials for 
spreading and dusts can advantageously be prepared by mixing or cogrinding the ac- 
tive substances together with a solid carrier. Granules, for example coated granules, 
impregnated granules and homogeneous granules, can be prepared by binding the 

30 selected compounds to solid carriers. The skilled worker is familiar with further details 
regarding their preparation, which are mentioned for example in the following publica- 
tions: US 3,060,084, EP-A 707445 (for liquid concentrates), Browning, "Agglomera- 
tion", Chemical Engineering, Dec. 4, 1967, 147-48, Perry's Chemical Engineer's Hand- 
book, 4th Ed., McGraw-Hill, New York, 1963, pages 8-57 and et seq. WO 91/13546, 

35 US 4,172,714, US 4,144,050, US 3,920,442, US 5,180,587, US 5,232,701, US 

5,208,030, GB 2,095,558, US 3,299,566, Klingman, Weed Control as a Science, John 
Wiley and Sons, Inc., New York, 1961, Hance et al., Weed Control Handbook, 8th Ed., 
Blackwell Scientific Publications, Oxford, 1989 and Mollet, H M Grubemann, A., Formu- 
lation technology, Wiley VCH Verlag GmbH, Weinheim (Federal Republic of Germany), 

40 2001. 
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The skilled worker is familiar with a multiplicity of inert liquid and/or solid carriers which 
are suitable for the formulations according to the invention, such as, for example, liquid 
additives such as mineral oil fractions of medium to high boiling point such as kerosene 
or diesel oil, furthermore coal tar oils and oils of vegetable or animal origin, aliphatic, 
5 cyclic and aromatic hydrocarbons, for example paraffin, tetrahydrophthalene, alkylated 
naphthalenes or their derivatives, alkylated benzenes or their derivatives, alcohols such 
as methanol, ethanol, propanol, butanol and cyclohexanol, ketones such as cyclohexa- 
none, or strongly polar solvents, for example amines such as N-methylpyrrolidone or 
water 

10 

Examples of solid carriers are mineral earths such as silicas, silica gels, silicates, talc, 
kaolin, limestone, lime, chalk, bole, loess, clay, dolomite, diatomaceous earth, calcium 
sulfate, magnesium sulfate, magnesium oxide, ground synthetic materials, fertilizers 
such as ammonium sulfate, ammonium phosphate, ammonium nitrate, ureas and 
15 products of vegetable origin such as cereal meal, tree bark meal, wood meal and nut- 
shell meal, cellulose powders or other solid earners. 

The skilled worker is familiar with the multiplicity of surface-active substances (surfac- 
tants) which are suitable for the formulations according to the invention such as, for 

20 example, alkali metal salts, alkaline earth metal salts or ammonium salts of aromatic 
sulfonic acids for example lignosulfonic acid, phenolsulfonic acid, naphthalenesulfonic 
acid, and dibutylnaphthalenesulfonic acid, and of fatty acids, of alkyl- and alkylarylsul- 
fonates, of alkyl sulfates, laury! ether sulfates and fatty alcohol sulfates, and salts of 
sulfated hexa-, hepta- and octadecanols and of fatty alcohol glycol ethers, condensates 

25 of sulfonated naphthalene and its derivatives with formaldehyde, condensates of naph- 
thalene or of the naphthalenesulfonic acids with phenol and formaldehyde, poly- 
oxyethylene octylphenol ether, ethoxylated isooctyl-, octyl- or nonylphenol, alkylphenyl 
poiyglycol ethers, tributylphenyl polyglycol ether, alkylaryl polyether alcohols, isotridecyl 
alcohol, fatty alcohol/ethylene oxide condensates, ethoxylated caster oil, polyoxyethyl- 

30 ene alkyl ethers or polyoxypropylene alkyl ethers, lauryl alcohol polyglycol ether ace- 
tate, sorbitol esters, lignosulfite waste liquors or methylcellulose. 

The herbicidal compositions, or the selected compounds, can be applied pre- or post- 
emergence. If the selected compounds are less well tolerated by certain crop plants, 
35 application techniques may be used in which the selected compounds are sprayed, 
with the aid of the spraying apparatus, in such a way that they come into as little con- 
tact, if any, with the leaves of the sensitive crop plants while the selected compounds 
reach the leaves of undesired plants which grow underneath, or the bare soil surface 
(post-directed, lay-by). 
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Depending on the intended purpose of the control measures, the season, the target 
plants and the growth stage, the application rates of selected compounds amount to 
0.001 to 3.0, preferably 0.01 to 1.0 kg/ha. 

5 The invention is illustrated in greater detail by the examples which follow, which are not 
to be considered as limiting. 

General DNA manipulation and cloning methods 

1 0 Cloning methods such as, for example, restriction cleavages, agarose gel electropho- 
reses, purification of DNA fragments, transfer of nucleic acids to nitrocellulose and ny- 
lon membranes, linking DNA fragments, transformation of Escherichia coli cells, grow- 
ing bacterium and sequence analyses of recombinant DNA were carried out as de- 
scribed by Sambrook et al. (1989) (Cold Spring Harbor Laboratory Press: ISBN 0- 

1 5 87969-309-6) and Ausubel, F.M. et al., Current Protocols in Molecular Biology, Greene 
Publishing Assoc. and Wiley-lnterscience (1994); ISBN 0-87969-309-6. 

Molecular-biological standard methods for plants and plant transformation methods are 
described in Schultz et al., Plant Molecular Biology Manual, Kluwer Academic Publish^ 
20 ere (1 998), Reither et al., Methods in Arabidopsis Research, World scientific press 
(1992) and Arabidopsis: A Laboratory Manual (2001), ISBN 0-87969-573-0. 

The bacterial strains used hereinbelow (E. coli DH5, XL-1 blue) were obtained from 
Stratagene, BRL Gibco or Invitrogen, Carlsberg, CA. The vectors used for cloning were 
25 pUC 19 from Amersham Pharmacia (Freiburg) and the vector pBinAR (Hdfgen and 
Willmitzer, Plant Science 66, 1990, 221-230). 

Example 1: Generation of a cDNA library in the plant transformation vector 

30 To generate a cDNA library (hereinbelow termed "binary cDNA library") in a vector 
which can be used directly for transforming plants, mRNA was isolated from a variety 
of plant tissues and transcribed into double-stranded cDNA using the cDNA Synthese 
Kit (Amersham Pharmacia Biotech, Freiburg). The cDNA first-strand synthesis was 
carried out using T12-18 oligonucleotides following the manufacturer's instructions. 

35 After size fractionation and the ligation of EcoRI-Notl adapters following the manufac- 
turer's instructions and filling up the overhangs with Pfu DNA polymerase (Stratagene), 
the cDNA population was normalized. The method of Kohci et al, 1995, Plant Journal 8, 
771-776 was followed, the cDNA being amplified by PCR with the oligonucleotide N1 
under the conditions given in Table 1. 
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Table 1 



Temperature [°C] 


Time [seel 


Number of cycles 


94 


300 


1 


94 


8 


10 


52 


60 




72 


180 




94 


8 


10 


50 


60 




72 


180 




94 


8 


10 


48 


60 




72 


180 




72 


420 


1 



5 The resulting PCR product was bound to the column matrix of the PCR purification kit 
(Qiagen, Hilden) and eluted with 300 mM NaP buffer, pH 7.0, 0.5 mM EDTA, 0.04% 
SDS. The DIMA was denatured for 5 minutes in a boiling water bath and subsequently 
renatured for 24 hours at 60oC. 50|jl of the DNA were applied to a hydroxylapatite col- 
umn and the column was washed 3 times with 1 ml of 10 mM NaP buffer, pH 6.8. The 
10 bound single-stranded DNA was eluted with 130 mM NaP buffer, pH 6.8, precipitated 
with ethanol and dissolved in 40 pi of water. 20 pi of growth were used for a further 
PCR amplification as described above. After further ssDNA concentration, a third PCR 
amplification was carried out as described above. 

15 The plant transformation vector for taking up the cDNA population which had been 
generated as described above was generated via restriction enzyme cleavage of the 
vector pUC18 with Sbfl and BamHI, purification of the vector fragment followed by fill- 
ing up the overhangs with Pfu DNA polymerase and relegation with T4 DNA ligase 
(Stratagene). The resulting construct is hereinbelow termed pUC18Sbfk 

20 

The vector pBinAR was first cleaved with Notl, the ends were filled up and the vector 
was relegated, cleaved with Sbfl, the ends were filled up and the vector was relegated 
and subsequently cleaved with EcoRI and Hindlll. The resulting fragment was ligated 
into a derivative of the binary plant transformation vector pPZP (Hajdukiewicz.P, Svab, 
25 Z, Maliga, P., (1994) Plant Mol Biol 25:989-994) which makes possible the transforma- 
tion of plants by means of agrobacterium and mediates kanamycin resistance in trans- 
genic plants. The construct generated thus is hereinbelow termed pSun12/35S. 
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pUC18Sbfl- was used as template in a polymerase chain reaction (PCR) with the oli- 
gonucleotides V1 and V2 (see Table 2) and Pfu DNA polymerase. The resulting frag- 
ment was ligated into the Smal-cut pSun12/35S, giving rise to pSunblues2. Following 
cleavage with Notl, dephosphorylation with shrimp alkaline phosphatase (Roche Diag- 
5 nostics, Mannheim) and purification of the vector fragment, pSunblues2 was ligated 
with the normalized, likewise Notl-cut cDNA population. Following transformation into 
E.coli XMblue (Stratagene), the resulting clones were deposited into microtiter plates. 
The binary cDNA library contains cDNAs in "sense"- and in "antisense" orientation un- 
der the control of the cauliflower mosaic virus 35S promoter, and, after transformation 
10 into tobacco plants, these cDNAs can, accordingly, lead to "cosuppression" and "an- 
tisense" effects. 



Table 2: Oligonucleotides used 



Oligonucleotide 


Nucleic acid sequence 


N1 


5'-AGAATTCGCGGCCGCT-3' (SEQ ID NO:23) 


V1 (PWL93not) 


5'-CTCATGCGGCCGCGCGCAACGCAATTAATGTG-3' (SEQ 
ID NO:24) 


V2 (pWL92) 


5'-TCATGCGGCCGCGAGATCCAGTTCGATGTAAC-3' (SEQ 
ID NO:25) 


G1 (35S) 


5'-GTGGATTGATGTGATATCTCC-3' (SEQ ID NO:26) j 


G2 (OCS) 


5-GTAAGGATCTGAGCTACACAT-3' (SEQ ID NO:27) j 



Example 2: Transformation and analysis of tobacco plants 

Selected clones of the binary cDNA library were transformed into Agrobacterium tume- 
20 faciens C58C1:pGV2260 and (Deblaere et al.. Nucl. Acids. Res. 13(1984), 4777-4788) 
and incubated with Streptomycin/Spectinomycin selection. The material used for the 
transformation of tobacco plants (Nicotiana tabacum cv. Samsun NN) with one of the 
binary clones as depicted in table 3 was an overnight culture of a positively trans- 
formed agrobacterial colony diluted with YEB medium to OD600 = 0.8-1.6. Leaf discs 
25 of sterile plants (approx. 1 cm2 each) were incubated for 5-10 minutes with a 1:50 
agrobacterial dilution in a Petri dish. This was followed by incubation in the dark for 2 
days at 25°C on Murashige-Skoog medium (Physiol. Plant. 15(1962), 473) supple- 
mented with 2% sucrose (2MS medium) and 0.8% Bacto agar. The cultivation was con- 
tinued after 2 days at a 16-hour-light/8-hour-darkness photoperiod and continued in a 
30 weekly rhythm on MS medium supplemented with 500mg/l Claforan (cefotaxime so- 
dium), 50mg/l kanamycin, 1mg/I benzylaminopurin (BAP), 0.2mg/l naphthylacetic acid 
and 1.6g/l glucose. Regenerated shoots were transferred onto an MS medium supple- 
mented with kanamycin and Claforan. Transgenic plants of lines as depicted in table 3 
were generated in this manner. 
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Table 3: Plant lines generated 



Partial cDNA with pheno- 
type in transgenic tobacco 


Plant line 


Corresponding 
full length cDNA 


Function 


SEQ ID NO: 20 


E_0000013511 


SEQ ID NO: 3 


ClpP2 protease 


SEQ ID NO:19 


E_0000008893 


SEQ ID NO: 11 


ClpP5=ClpR1 pro- 
tease 


SEQ ID NO:21 


E_0000012393 




ClpP6 protease 






SEQ ID NO:17 


ClpP-like protease 



The integration of the clone cDNA into the genome of the transgenic lines was detected 
5 via PCR with the oligonucleotides G1 and G2 (see Table 2) and genomic DNA pre- 
pared from the transgenic lines in question. To this end, TAKARA Taq DNA poly- 
merase was preferably employed for this purpose, following the manufacturer's instruc- 
tions (MoBiTec, Gottingen). The cDNA clone of the binary cDNA library, which clone 
had been used for the transformation, acted as template for a PCR reaction as the 
10 positive control. PCR products with an identical size or, if appropriate, identical cleav- 
age patterns which were obtained after cleavage with a variety of restriction enzymes 
acted as proof that the corresponding cDNA had been integrated. In this manner, the 
insert of clones were detected in the respective transgenic plant lines (as depicted in 
table 3) with the belowmentioned phenotypes. 

15 

After the shoots had been transferred into soil, the plants were observed for 2-20 
weeks in the greenhouse for the manifestation of phenotypes. It emerged that trans- 
genic plants of lines E_0000012393, E_000001351 1 and EJ3000008893 were similar 
in phenotype. The plants showed severe chlorosis and concomitant growth retardation 
20 with respect to wild type plants after 2 weeks. 

Example 3: Sequence analysis of the clones 

SEQ ID NO: 19 was fully sequenced and used for the detection of the corresponding full 
25 length clone SEQ ID NO:1 1 . SEQ ID NO:1 1 is identical to nt002050071 r SEQ ID 
NO:19 in the overlapping region. An open reading frame of 867 nt (pos. 2-1162) en- 
codes for 387 amino acids (SEQ ID NO: 4) with highest identity to-ClpR1 from Arabi- 
dopsis thaliana-. Sequence homology suggests that the S'-ends of SEQ ID NO:1 1 and 
ClpR1 from Arabidopsis thaliana are very diverse and that nt006066004r is close to 
30 being full size with respect to ClpR1 . MS-Analysis of isolated CIpP Proteins from Arabi- 
dopsis indicate, that the mature ClpR1 is several kDal shorter as suggested by the 
cDNA Sequence (Peltier et al. 2001, The Journal of Biological Chemistry 276, 
99.16318-16327). 



WO 2005/054283 



PCT/EP2004/013555 



49 

SEQ ID NO:20 was fully sequenced and used for the detection of the corresponding full 
length clone, SEQ ID NO: 3r SEQ ID NO:3 is identical to SEQ ID NO:20 in the overlap- 
ping region. An open reading frame of 867 nt (pos. 1 1-877) encodes for 289 amino 
acids (SEQ ID NO:4) with highest identity to ClpP2 from Arabidopsis thaliana. 

5 

SEQ ID NO:21 was fully sequenced . The partial cDNA Sequence of 602 nt contains an 
open reading frame of 186 nt (nt 8-193) encoding for 62 amico acids (SEQ ID NO:22). 
This partial polypeptide shows highest identity to ClpP6 from Arabidopsis thaliana 
(SEQ ID NO:9) 

10 

A further ClpP-homolog cDNA of 906 nt (SEQ ID NO: 17) was identified. An open read- 
ing frame of 71 1 nt (pos. 45-755) encodes for 237 amino acids (SEQ ID NO: 18) with 
highest identity to a ClpP-like protein from Arabidopsis thaliana (GeneBank Acc. No. 
AK1 18523). 

15 

Thus, it was shown for the first time and in a surprising manner that the natural expres- 
sion of nuclear encoded Cip protease encoding genes is essential for plants and that 
reduced expression leads to damage as depicted by the phenotypes mentioned in Ex- 
ample 2 demonstrating the suitability of nuclear encoded Clp-proteases as target for 
20 herbicides. 

Example 4: Expression in E.coli 

In order to generate active protein with nuclear encoded Clp-protease activity frag- 
25 ments of SEQ ID NO:1 1, -SEQ ID NO:3 and of SEQ ID NO:9 were subcloned into the 
expression vector pQE60 (Quiagen, Hilden, Germany). To this end the oligonucleotides 
displayed in tab. 4 where used to amplify via polymerase chain reaction cDNA frag- 
ments that contain Ncol and Bglll restriction sites. The PCR was carried out in 36 cy- 
cles following standard conditions (for example as described by Sambrook, J. et al. 
30 (1 989) "Molecular cloning: A laboratory manual", Cold Spring Harbor Laboratory 

Press), the annealing temperatures being between 45 and 55°C and the polymerization 
time being in each case 60 seconds per 1000bp. Cutting the cDNA fragments with Ncol 
and Bglll restriction enzymes and ligation into pQE60 cut with the same enzymes de- 
livered expression plasmids that were transformed into E. coli. Expression was per- 
35 formed in E. coli TOP 10F strains (Invitrogen, Karlsruhe, Germany) following induction 
with IPTG. Standard protocols (Invitrogen) were followed. 

Enzyme preperations were achieved by breaking cells in a French-Press in 100mM 
Tris/HCI, pH 7.4, 2.5 mM EDTA, 1% Triton X-100. 

40 

The expression products were purified by affinity chromatography on Ni-agarose where 
appropriate. The manufacturer's instructions were followed (Qiagen). 
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Table 4 



30 



Construct 


Primer (Nucleic acid sequence) 




S'-TATACCATGGATTTGCCATCTTTG-a' (SEQ ID NO:28) 
5'-ATAGATCTCACCTGGAGCCAG-3* (SEQ ID NO:29) 


Nt_ClpP2 1) 


5-GAGCCCATGGCAAGAGGAG -3' (SEQ ID NO:30) 
S'-ATAGATCTTTCTAGCTTGAACC-S' (SEQ ID NO:31) 




5-TCAGCCATGGCCCCTGGAGGAC -3'(SEQ ID NO:32) 
S'-TAAGATCTTCAGTATTCTGTTTCC-S' (SEQ ID NO:33) 



1) Template: Nicotiana tabacum cDNA library 
5 2) Template: Arabidopsis thaliana cDNA library 

Example 5: Activity assay 

Isolated CIpP activity can be measured as described (Woo et al. 1989 The Journal of 
10 Biological Chemistry 264, pp.2088-2091) by using fluoregenic substrates such as N- 
Suc-Leu-Tyr-(7-amino-4-methylcoumarine) (SLT-AMG). the proteolytic cleavage delib- 
erates 7-amino-4-methylcoumarin, which can be detected fluorimetrically (emission at 
460nm by exitation at 390 nm). 

Standard assays contain: 50mM Tris/HCI, pH 8,0, 25mM MgCI 2 , 1mM SLT-AMC and 1- 
15 100 pg CIpP Enzyme. 

The assay is suitable in for high throughput screening in 96well and 384 well format. 

Screening according to the above mentioned assay provided the following compounds 
20 of the formula: 



25 

or 




formula (I) 



ox 



A 



formula (II) 



or " ci 

35 
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10 




showing a inhibition of the enzyme of: 



formula (III). 



compound of formula 


IC50 


(I) 


2.3E-05 


(") 


1.9E-05 


(III) 


2.5E-05 



20 



25 



35 



Sequence Listing 






Sequence 


Function 


Organism 


SEQ ID NO:1 (nucleic acid sequence) 


ClpP1 


Arabidopsis thanliana 


SEQ ID NO:2 (amino acid sequence) 


ClpP1 


Arabidopsis thanliana 


SEQ ID NO:3 (nucleic acid sequence) 


ClpP2 


Nicotiana tabacuum 


SEQ ID NO:4 (amino acid sequence) 


ClpP2 


Nicotiana tabacuum 


SEQ ID NO:5 (nucleic acid sequence) 


ClpP3 


Arabidopsis thanliana 


SEQ ID NO:6 (amino acid sequence) 


ClpP3 


Arabidopsis thanliana 


SEQ ID NO:7 (nucleic acid sequence) 


ClpP4 


Arabidopsis thanliana 


SEQ ID NO:8 (amino acid sequence) 


ClpP4 


Arabidopsis thanliana 


SEQ ID NO:9 (nucleic acid sequence) 


ClpP6 


Arabidopsis thanliana 
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5 



SEQ ID NO: 10 (amino acid sequence) ClpP6 Arabidopsis thaniiana 

SEQ ID NO:11 (nucleic acid sequence) ClpR1 Nicotiana tabacuum 

SEQ ID NO:12 (amino acid sequence) ClpR1 Nicotiana tabacuum 

SEQ ID NO:13 (nucleic acid sequence) ClpR3 Arabidopsis thaniiana 

10 SEQ ID NO: 14 (amino acid sequence) ClpR3 Arabidopsis thaniiana 

SEQ ID NO: 15 (nucleic acid sequence) CIpR4 Arabidopsis thaniiana 



SEQ ID NO:16 (amino acid sequence) ClpR4 Arabidopsis thaniiana 

15 

SEQ ID NO:17 (nucleic acid sequence) CIpP like Arabidopsis thaniiana 

SEQ ID NO: 18 (amino acid sequence) CIpP like Arabidopsis thaniiana 

20 SEQ ID NO:19 (nucleic acid sequence) ClpR1 Nicotiana tabacuum 
(fragment) 

SEQ ID NO:20 (nucleic acid sequence) ClpP2 Nicotiana tabacuum 
(fragment) 

25 

SEQ ID NO:21 (nucleic acid sequence) ClpP6 Nicotiana tabacuum 
(fragment) 

SEQ ID NO:22 (amino acid sequence) ClpP6 Nicotiana tabacuum 
30 (fragment) 



SEQ ID NO:23-33: Primer (nucleic acid sequences) 
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We claim: 

1 . The use of nuclear encoded Clp-protease in a method for identifying herbicides. 

2. The use as claimed in claim 1, wherein the Clp-protease is 

a) selected from the group consisting of ClpP1-protease, ClpP2-protease, 
ClpP3-protease, ClpP4-protease and ClpP6-protease; or 

b) selected from the group consisting of ClpR1 -protease, ClpR3-protease, 
ClpR4-protease; or 

c) ClpP-like-protease. 

3. A plant nucleic acid sequence encoding a ClpP2-protease comprising: 

a) a nucleic acid sequence with the nucleic acid sequence shown in SEQ ID 
NO:3, or 

b) a nucleic acid sequence which, owing to the degeneracy of the genetic 
code, can be deduced from the amino acid sequence shown in SEQ ID 
NO:4 by backtranslating, or 

c) a functional equivalent of nucleic acid sequence shown in SEQ ID NO:3 
which has an identity with SEQ ID NO:3 of has at least 66%. 

4. A plant nucleic acid sequence encoding a CIpR1 -protease comprising: 

a) a nucleic acid sequence with the nucleic acid sequence shown in SEQ ID 
NO:11,or 

b) a nucleic acid sequence which, owing to the degeneracy of the genetic 
code, can be deduced from the amino acid sequence shown in SEQ ID 
NO: 12 by backtranslating, or 

c) a functional equivalent of nucleic acid sequence shown in SEQ ID NO:1 1 
which has an identity with SEQ ID NO:1 1 of has at least 69%. 

5. A plant nucleic acid sequence encoding a ClpP-like-protease comprising: 
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a) a nucleic add sequence with the nucleic acid sequence shown in SEQ ID 
NO: 17, or 

b) a nucleic acid sequence which, owing to the degeneracy of the genetic 
5 code, can be deduced from the amino acid sequence shown in SEQ ID 

NO:18 by backtranslating, or 

c) a functional equivalent of nucleic acid sequence shown in SEQ ID NO: 17 
which has an identity with SEQ ID NO: 17 of has at least 67%. 

10 

6. A polypeptide with the activity of a nuclear encoded Clp-protease, encoded by a 
nucleic acid molecule as claimed in claim 3, 4 or 5. 

7. An expression cassette comprising 

a) genetic control sequences in operable linkage with a nucleic acid sequence 
as claimed in claim 3, 4 or 5; or 

b) additional functional elements, or 

c) a combination of a) and b). 

8. A vector comprising an expression cassette as claimed in claim 7. 

25 9. A transgenic organism comprising at least one nucleic acid sequence as claimed 
in claim 4, 5 or 6 encoding a polypeptide with the activity of a Clp-protease, an 
expression cassette as claimed in claim 7 or a vector as claimed in claim 8, se- 
lected from among bacteria, yeasts, fungi, animal cells or plant cells. 

30 10. A method for identifying substances with herbicidal activity, comprising the follow- 
ing steps: 

i. bringing a nuclear encoded Clp-protease into contact with one or more test 
compounds under conditions which permit the test compound(s) to bind to 

35 the nucleic acid molecule encoding Clp-protease or to the nuclear encoded 

Clp-protease, and 

ii. detecting whether the test compound binds to the Clp-protease of i), or 



20 



40 



Hi. 



detecting whether the test compound reduces or blocks the enzymatic or 
biological activity of the Clp-protease of i), or 
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iv. detecting whether the test compound reduces or blocks the transcription, 
translation or expression of the Clp-protease of i). 

11. A method as claimed in claim 10, wherein the Clp-protease is 

a) selected from the group consisting of ClpP1 -protease, ClpP2-protease, 
ClpP3-protease, ClpP4-protease and ClpP6-protease; or 



b) selected from the group consisting of ClpR1 -protease, ClpR3-protease, 
10 ClpR4-protease; or 

c) ClpP-like-protease. 

12. A method as claimed in claim 10, wherein 



15 



25 



a) the ClpP1 -protease is encoded by a nucleic acid sequence which com- 
prises: 



i) a nucleic acid sequence with the nucleic acid sequence shown in 
20 SEQIDNO:1,or 

ii) a nucleic acid sequence which, owing to the degeneracy of the ge- 
netic code, can be deduced from the amino acid sequence shown in 
SEQ ID NO:2 by back translating, or 



a functional equivalent of nucleic acid sequence shown in SEQ ID 
NO:1 which has an identity with SEQ ID NO:1 of has at least 50%; 



b) the ClpP2-protease is encoded by a nucleic acid sequence which com- 
30 prises: 

i) a nucleic acid sequence with the nucleic acid sequence shown in 
SEQ ID NO:3, or 

35 ii) a nucleic acid sequence which, owing to the degeneracy of the ge- 

netic code, can be deduced from the amino acid sequence shown in 
SEQ ID NO:4 by back translating, or 



40 



iii) a functional equivalent of nucleic acid sequence shown In SEQ ID 
NO:3 which has an identity with SEQ ID NO:3 of has at least 50%; 
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c) . the ClpP3-protease is encoded by a nucleic acid sequence which com- 
prises: 

i) a nucleic acid sequence with the nucleic acid sequence shown in 
> SEQIDNO:5,or 

ii) a nucleic acid sequence which, owing to the degeneracy of the ge- 
netic code, can be deduced from the amino acid sequence shown in 
SEQ ID NO:6 by back translating, or 

iii) a functional equivalent of nucleic acid sequence shown in SEQ ID 
NO:5 which has an identity with SEQ ID NO:5 of has at least 50%; 



d) the ClpP4-protease is encoded by a nucleic acid sequence which com- 
15 prises: 

i) a nucleic acid sequence with the nucleic acid sequence shown in 
SEQ ID NO:7, or 

20 ii) a nucleic acid sequence which, owing to the degeneracy of the ge- 

netic code, can be deduced from the amino add sequence shown in 
SEQ ID NO:8 by back translating, or 

iii) a functional equivalent of nucleic acid sequence shown in SEQ ID 
25 NO:7 which has an identity with SEQ ID NO:7 of has at least 50%; 

e) the ClpP6~protease is encoded by a nucleic acid sequence which com- 
prises: 

30 i) a nucleic acid sequence with the nucleic acid sequence shown in 

SEQIDNO:9, or 

ii) a nucleic acid sequence which, owing to the degeneracy of the ge- 
netic code, can be deduced from the amino acid sequence shown in 

35 SEQ ID NO: 10 by back translating, or 

iii) a functional equivalent of nucleic acid sequence shown in SEQ ID 
NO:9 which has an identity with SEQ ID NO:9 of has at least 50%; 



40 f) 



the ClpR1 -protease is encoded by a nucleic acid sequence which com 
prises: 
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i) a nucleic acid sequence with the nucleic acid sequence shown in 
SEQIDNO:11,or 

ii) a nucleic acid sequence which, owing to the degeneracy of the ge- 

5 netic code, can be deduced from the amino acid sequence shown in 

SEQ ID NO: 12 by back translating, or 

iii) a functional equivalent of nucleic acid sequence shown in SEQ ID 
NO:1 1 which has an identity with SEQ ID NO:1 1 of has at least 50%; 

g) the ClpR3-protease is encoded by a nucleic acid sequence which com- 
prises: 



i) a nucleic acid sequence with the nucleic acid sequence shown in 
15 SEQ ID NO: 13, or 

ii) a nucleic acid sequence which, owing to the degeneracy of the ge- 
netic code, can be deduced from the amino acid sequence shown in 
SEQ ID NO:14 by back translating, or 

20 

iii) a functional equivalent of nucleic acid sequence shown in SEQ ID 
NO: 13 which has an identity with SEQ ID NO: 13 of has at least 50%; 

h) the ClpR4-protease is encoded by a nucleic acid sequence which com- 
25 prises: 

i) a nucleic acid sequence with the nucleic acid sequence shown in 
SEQ IDNO:15, or 

30 ii) a nucleic acid sequence which, owing to the degeneracy of the ge- 

netic code, can be deduced from the amino acid sequence shown in 
SEQ ID NO: 16 by back translating, or 

iii) a functional equivalent of nucleic acid sequence shown in SEQ ID 
35 NO: 15 which has an identity with SEQ ID NO: 15 of has at least 50%; 

i) the CIpP like-protease is encoded by a nucleic acid sequence which com- 
prises: 



40 



i) a nucleic acid sequence with the nucleic acid sequence shown in 
SEQ IDNO:17, or 
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ii) a nucleic acid sequence which, owing to the degeneracy of the ge- 
netic code, can be deduced from the amino acid sequence shown in 
SEQ ID NO: 18 by back translating, or 

5 iii) a functional equivalent of nucleic acid sequence shown in SEQ ID 

NO: 17 which has an identity with SEQ ID NO: 17 of has at least 50%; 

13. A method as claimed in claim 10, 1 1 or 12, wherein a test compound is selected 
which reduces or blocks the enzymatic or biological activity of Clp-protease. 

14. A method as claimed in any of claims 10, 11, 12 or 13, wherein 

i. either Clp-protease is expressed in a transgenic organism or an organism 
which naturally contains Clp-protease is grown, 

ii. the Clp-protease of step i) is brought into contact with a test compound in 
the cell digest of the transgenic or nontransgenic organism, in partially puri- 
fied form or in homogeneously purified form, and 

20 iii. selecting a test compound which reduces or blocks the enzymatic activity of 

the Clp-protease of step a). 

15. A method as claimed in any of claims 10, 1 1, 12 or 13, which comprises the 
following steps: 



15 



25 



L generating a transgenic organism comprising a nucleic acid sequence en- 
coding Clp-protease, wherein Clp-protease is expressed recombinantly; 



ii. applying a test substance to the transgenic organism of i) and to a non- 
30 transgenic organism of the same genotype, 

iii. determining the growth or the viability of the transgenic plant and the non- 
transgenic plant after application of the test compound, and 

35 iv. selecting test substances which bring about a reduced growth of the non- 

transgenic plant in comparison with the growth of the transgenic plant. 



16. A method as claimed in claim 15, which is carried out in a plant organism, a 
cyanobacterium or proteobacterium. 

40 

17. A method for identifying substances with growth-regulatory activity, which com- 
prises the following steps: 
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i. generating a transgenic plant comprising a nucleic acid sequence Clp- 
protease, wherein Clp-protease is expressed recombinantly; 

ii. applying a test substance to the transgenic plant of i) and to a nontrans- 
genic plant of the same variety, 

iii. determining the growth or the viability of the transgenic plant and the non- 
transgenic plant after application of the test compound, and 

iv. selecting test substances which bring about a reduced growth of the non- 
transgenic plant in comparison with the growth of the transgenic plant. 



18. A method as claimed in any of claims 10 to 17, wherein the substances are iden- 
15 tified in high-throughput screening method. 

1 9. A support comprising one or more of the nucleic acid molecules as claimed in 
claim 3, 4, or 5 one or more expression cassettes as claimed in claim 7, one or 
more vectors as claimed in claim 8, one or more organisms as claimed in claim 9 

20 or one or more (polypeptides as claimed in claim 6. 

20. A method as claimed in any of claims 10 to 18, wherein the substances are iden- 
tified in High-Throughput Screening using a support as claimed in claim 19. 



25 21. 



The use of a compound with herbicidal activity, identified by one of the methods 
as claimed in any of claims 10 to 16, 18 and 20 for controlling undesired vegeta- 
tion and/or for regulating the growth of plants. 



22. The use of a compound with growth-regulatory activity, identified by the method 
30 as claimed in any of claims 1 7, 1 8 or 20 for controlling undesired vegetation 

and/or for regulating the growth of plants. 

23. A method for the preparation of an agrochemical composition, which comprises 

35 a ) identifying a compound with herbicidal activity by one of the methods as 

claimed in any of claims 10 to 16, 18 and 20 or a compound with growth- 
regulatory activity as claimed in any of claims 17, 18 or 20, and 



b) 

40 



formulating this compound together with suitable auxiliaries to give crop 
protection products with herbicidal or growth-regulatory activity. 
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60 

24. The use of at least one Clp-protease inhibitor identified by one of the methods as 
claimed in any of claims 10 to 16, 18 and 20 in a method for controlling undesired 
vegetation and/or for regulating the growth of plants. 

5 25. A method for controlling undesired vegetation and/or for regulating the growth of 
plants comprising treating said weeds with a herbicide, wherein said herbicide is 
a compound which is a inhibitor of a Clp-protease. 



10 



15 



26. Clp-protease inhibitor of the formula: 

CI CI 



/ 



formula (I) 



or 



20 



25 



30 



35 



or 




N ^Cl 



formula (II) 




formula (III). 
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SEQUENCE LISTING 



PCT/EP2004/013555 



10 



15 



20 



30 



40 



<110> BASF Aktiengesellschaf t 



<120> Cip-protease as target for herbicides 



<130> 20030949 



<160> 33 



<170> Patentln version 3.1 



25 <210> 1 

<211> 591 

<212> DNA 

<213> Arabidopsis thaliana 



35 <220> 

<221> CDS 
<222> (1) . . (591) 
<223> 



45 <400> 1 

atg cct att ggc gtt cca aaa gta cct ttt cga agt cct gga gaa gga 48 

Met Pro lie Gly Val Pro Lys Val Pro Phe Arg Ser Pro Gly Glu Gly 
1 5 10 15 

50 gat aca tct tgg gtt gac ata tac aac cga ctt tat cga gaa aga tta 96 
Asp Thr Ser Trp Val Asp lie Tyr Asn Arg Leu Tyr Arg Glu Arg Leu 
20 25 30 

ttt ttt tta ggc caa gag gtt gat acc gaa ate teg aat caa ctt att 144 
55 Phe Phe Leu Gly Gin Glu Val Asp Thr Glu lie Ser Asn Gin Leu lie 
35 40 45 

agt ctt atg ata tat etc agt ata gaa aag gat acc aaa gat ctt tat 192 
Ser Leu Met lie Tyr Leu Ser He Glu Lys Asp Thr Lys Asp Leu Tyr 
60 50 55 " 60 

ttg ttt ata aac tct cct ggt gga tgg gta ata tct gga atg get att 240 
Leu Phe He Asn Ser Pro Gly Gly Trp Val He Ser Gly Met Ala He 
65 70 75 80 

65 

tat gat act atg caa ttt gtg cga ccc gat gta cag aca ata tgc atg 288 



1 'I 

WO 2005/054283 P<5¥/SK00l?0T3555 J 



Tyr Asp Thr Met Gin Phe Val Arg Pro Asp Val |^ggf#$0) Q I Jty® 

85 90 „ 95 

gga ttg gcc get tea ata gca tec ttt ate eta gtc gga gga gca att 336 
5 Gly Leu Ala Ala Ser lie Ala Ser Phe lie Leu Val Gly Gly Ala lie 
100 105 110 

ace aaa cgt ata gca ttc cct cac get agg gta atg ate cat caa ccc 384 
Thr Lys Arg lie Ala Phe Pro His Ala Arg Val Met He His Gin Pro 
10 115 120 125 

get agt teg ttt tat gag gca caa acg gga gaa ttt ate ttg gaa gcg 432 
Ala Ser Ser Phe Tyr Glu Ala Gin Thr Gly Glu Phe He Leu Glu Ala 
130 135 140 



15 



gaa gaa tta ctt aaa ctt cgc gaa acc ate aca agg gtt tat gta caa 480 
Glu Glu Leu Leu Lys Leu Arg Glu Thr He Thr Arg Val Tyr Val Gin 
145 150 155 160 



20 aga acg ggc aaa cct ata tgg gtt ata tec gaa gac atg gaa egg gat 528 
Arg Thr Gly Lys Pro He Trp Val He Ser Glu Asp Met Glu Arg Asp 
165 170 175 

gtt ttt atg tea gca aca gaa gcc caa get cat gga att gtt gat ctt 576 
25 Val Phe Met Ser Ala Thr Glu Ala Gin Ala His Gly He Val Asp Leu 
180 185 190 

gta gcg gtt caa taa 591 
Val Ala Val Gin 
30 195 

<210> 2 

35 <211> 196 

<212> PRT 

<213> Arabidopsis thaliana 

40 

<400> 2 

45 Met Pro He Gly Val Pro Lys Val Pro Phe Arg Ser Pro Gly Glu Gly 
1 5 10 15 

Asp Thr Ser Trp Val Asp He Tyr Asn Arg Leu Tyr Arg Glu Arg Leu 
50 20 25 30 



55 



Phe Phe Leu Gly Gin Glu Val Asp Thr Glu He Ser Asn Gin Leu He 
35 40 45 



Ser Leu Met He Tyr Leu Ser He Glu Lys Asp Thr Lys Asp Leu Tyr 
50 55 60 



60 



Leu Phe He Asn Ser Pro Gly Gly Trp Val He Ser Gly Met Ala He 
65 70 75 80 
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3 

Tyr Asp Thr Met Gin Phe Val Arg Pro Asp Val Gin Thr lie Cys Met. 

85 90 95 

5 Gly Leu Ala Ala Ser lie Ala Ser Phe lie Leu Val Gly Gly Ala lie 
100 105 110 

Thr Lys Arg lie Ala Phe Pro His Ala Arg Val Met lie HiB Gin Pro 
10 115 120 125 



15 



20 



Ala Ser Ser Phe Tyr Glu Ala Gin Thr Gly Glu Phe lie Leu Glu Ala 
13 0 135 140 



Glu Glu Leu Leu Lys Leu Arg Glu Thr lie Thr Arg Val Tyr Val Gin 
145 150 155 160 



Arg Thr Gly Lys Pro lie Trp Val lie Ser Glu Asp Met Glu Arg Asp 
165 170 175 



25 Val Phe Met Ser Ala Thr Glu Ala Gin Ala His Gly lie Val Asp Leu 
180 185 190 

Val Ala Val Gin 
30 195 

<210> 3 
35 <211> 1024 
<212> DNA 

<213> Nicotiana tabacum 

40 

<220> 
45 <221> CDS 

<222> (11).. (877) 
<223> 

50 

<400> 3 

gcggccgcta atg gcg gtc act ttt ccg acc acc tct tec teg tat eta 49 
55 Met Ala Val Thr Phe Pro Thr Thr Ser Ser Ser Tyr Leu 

15 10 

cac teg aga act aaa gtc cct cag cct tct tta age tgc gec age aaa 97 
His Ser Arg Thr Lys Val Pro Gin Pro Ser Leu Ser Cys Ala Ser Lys 
60 15 20 25 



gtt ttt gtc gga tta aga age caa tct cct aat tct tat ggg att gca 145 
Val Phe Val Gly Leu Arg Ser Gin Ser Pro Asn Ser Tyr Gly He Ala 
30 35 40 ~ 45 
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4 
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gcg tct aat gta aat gtt gaa ttt cac aat aga gtg tac aga agt att 193 
Ala Ser Asn Val Asn Val Glu Phe His Asn Arg Val Tyr Arg Ser lie 
50 55 60 

5 

gaa tec gga act aga gac agt aaa cca aca cgt gta cga gtt tec atg 241 
Glu Ser Gly Thr Arg Asp Ser Lys Pro Thr Arg Val Arg Val Ser Met 
65 70 75 

10 atg ccc att ggg aca cca aga gta ccc tac aga aat cca act gag gga 289 
Met Pro lie Gly Thr Pro Arg Val Pro Tyr Arg Asn Pro Thr Glu Gly 
80 85 90 

aca tgg cag tgg gtt gat ttg tgg aat get ctt tac cgt gaa cgt gtt 337 
15 Thr Trp Gin Trp Val Asp Leu Trp Asn Ala Leu Tyr Arg Glu Arg Val 
95 100 105 

att ttc ate gga caa cac ata gat gaa gaa ttt age aac cag ata ttg 385 
lie Phe lie Gly Gin His lie Asp Glu Glu Phe Ser Asn Gin lie Leu 
20 110 115 120 125 

gca aca atg ctg tat ctt gac agt att gat gat tec aag aag etc tac 433 
Ala Thr Met Leu Tyr Leu Asp Ser lie Asp Asp Ser Lys Lys Leu Tyr 
130 135 140 



25 



45 



ctg tat ate aat ggc cct ggt ggt gat eta act cca age atg gee ate 481 
Leu Tyr lie Asn Gly Pro Gly Gly Asp Leu Thr Pro Ser Met Ala lie 
145 150 155 



30 tac gac aca atg caa agt ctt aaa agt get gtt ggc ace cat tgt gtg 529 
Tyr Asp Thr Met Gin Ser Leu Lys Ser Ala Val Gly Thr His Cys Val 
160 165 170 

ggc tat gec tac aat ctt gee ggt ttt ctt ctt get get gga gaa aag 577 
35 Gly Tyr Ala Tyr Asn Leu Ala Gly Phe Leu Leu Ala Ala Gly Glu Lys 
175 180 185 

ggc aat cga ttt gca atg cct ctt tea agg att gca eta caa tct cca 625 
Gly Asn Arg Phe Ala Met Pro Leu Ser Arg lie Ala Leu Gin Ser Pro 
40 190 ~ 195 200 205 

get gga get gcg cgc gga cag get gat gat att cgc aat gaa gca gat 673 
Ala Gly Ala Ala Arg Gly Gin Ala Asp Asp lie Arg Asn Glu Ala Asp 
210 215 220 



gaa ctt etc aga att aga gat tac ctt ttc aag gag ttg get gag aag 721 
Glu Leu Leu Arg lie Arg Asp Tyr Leu Phe Lys Glu Leu Ala Glu Lys 
225 230 235 



50 aca ggc cag cct gtt gaa aag gtt cac aag gat tta agt egg atg aag 769 

Thr Gly Gin Pro Val Glu LyB Val His Lys Asp Leu Ser Arg Met Lys 
240 245 250 

cga etc aat get caa gaa get ctt gaa tat ggt ctt ata gac cgt ata 817 

55 Arg Leu Asn Ala Gin Glu Ala Leu Glu Tyr Gly Leu lie Asp Arg He 
255 260 265 

gtt agg cct ccc cgt att aag gca gat get cca cga aag gat ace aca 865 

Val Arg Pro Pro Arg He Lys Ala Asp Ala Pro Arg Lys Asp Thr Thr 
60 270 275 280 285 



gca ggt ctt ggt tagtccatac acategtata atttatggct gatagtggtt 
Ala Gly Leu Gly 



917 
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5 
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gtacgacttg cagtgttatt ttgcaatttc ttttgtttaa tctacatatt gaactctttt 977 
gatctactta ttcaaaaaca tgaaatcctg agcagactag cggccgc 1024 

5 

<210>. 4 
<211> 289 

10 

<212> PRT 

<213> Nicotiana tabacum 

15 

<400> 4 

Met Ala Val Thr Phe Pro Thr Thr Ser Ser Ser Tyr Leu His Ser Arg 
20 1 5 10 15 

Thr Lys Val Pro Gin Pro Ser Leu Ser Cys Ala Ser Lys Val Phe Val 
20 25 30 

25 

Gly Leu Arg Ser Gin Ser Pro Asn Ser Tyr Gly lie Ala Ala Ser Asn 
35 40 45 

30 

Val Asn Val Glu Phe His Asn Arg Val Tyr Arg Ser lie Glu Ser Gly 
50 55 " 60 

35 Thr Arg Asp Ser Lys Pro Thr Arg Val Arg Val Ser Met Met Pro He 
65 70 75 80 

Gly Thr Pro Arg Val Pro Tyr Arg Asn Pro Thr Glu Gly Thr Trp Gin 
40 85 90 ^ 95 

Trp Val Asp Leu Trp Asn Ala Leu Tyr Arg Glu Arg Val He Phe He 
100 105 110 

45 

Gly Gin His He Asp Glu Glu Phe Ser Asn Gin He Leu Ala Thr Met 
115 120 125 

50 

Leu Tyr Leu Asp Ser He Asp Asp Ser Lys Lys Leu Tyr Leu Tyr He 
130 135 140 

55 Asn Gly Pro Gly Gly Asp Leu Thr Pro Ser Met Ala He Tyr Asp Thr 
145 150 155 160 

Met Gin Ser Leu Lys Ser Ala Val Gly Thr His Cys Val Gly Tyr Ala 
60 165 170 ~ 175 



Tyr Asn Leu Ala Gly Phe Leu Leu Ala Ala Gly Glu Lys Gly Asn Arg 
180 185 ^ ^ 190 
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10 



Phe Ala Met Pro Leu Ser Arg lie Ala Leu Gin Ser Pro Ala Gly Ala 
195 200 205 



Ala Arg Gly Gin Ala Asp Asp lie Arg Asn Glu Ala Asp Glu Leu Leu 
210 * 215 220 



Arg lie Arg Asp Tyr Leu Phe Lys Glu Leu Ala Glu Lys Thr Gly Gin 
225 230 235 240 



15 Pro Val Glu Lys Val His Lys Asp Leu Ser Arg Met Lys Arg Leu Asn 

245 250 255 

Ala Gin Glu Ala Leu Glu Tyr Gly Leu lie Asp Arg lie Val Arg Pro 
20 260 265 270 

Pro Arg lie Lys Ala Asp Ala Pro Arg Lys Asp Thr Thr Ala Gly Leu 
275 280 ~ 285 

25 

Gly 

30 

<210> 5 
<211> 1124 
35 <212> DMA 

<213> Arabidopsis thaliana 

40 

<220> 

<221> CDS 
45 <222> (2).. (931) 
<223> 

50 

<400> 5 

a atg gag atg agt ttg cgt etc get tea tct tea ace tea aac cca att 49 
Met Glu Met Ser Leu Arg Leu Ala Ser Ser Ser Thr Ser Asn Pro lie 
15 10 15 

55 

tgt eta eta aac cct gga aaa aac ctt aat ttc cca ate cga aac cat 97 
Cys Leu Leu Asn Pro Gly Lys Asn Leu Asn Phe Pro lie Arg Asn His 
20 25 30 

60 aga ate cct aaa act teg aaa ccc ttt tgc gtt agg tct tea atg age 145 
Arg He Pro Lys Thr Ser Lys Pro Phe Cys Val Arg Ser Ser Met Ser 
35 40 45 



ttg tct aaa cca ccc aga caa acc tta tct agt aac tgg gat gta tct 



193 
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7 

Leu Ser Lys Pro Pro Arg Gin Thr Leu Ser Ser Asn Trp Asp Val Ser. 
50 55 60 

age ttc tec att gat tec gtt get caa tct cct. tea aga etc cca agt 241 

5 Ser Phe Ser lie Asp Ser Val Ala Gin Ser Pro Ser Arg Leu Pro Ser 

65 70 75 80 

ttc gaa gaa etc gat acc acc aac atg ttg etc cgt caa aga ate gtc 289 

Phe Glu Glu Leu Asp Thr Thr Asn Met Leu Leu Arg Gin Arg lie Val 
10 85 90 95 

ttt ttg ggt tct cag gtt gat gat atg acg gcg gat ttg gtt ata agt 337 

Phe Leu Gly Ser Gin Val Asp Asp Met Thr Ala Asp Leu Val lie Ser 
100 105 ~ 110 



15 



35 



55 



cag eta ttg tta eta gat get gag gac tea gaa aga gac att aeg ett 385 
Gin Leu Leu Leu Leu Asp Ala Glu Asp Ser Glu Arg Asp lie Thr Leu 
115 120 125 



20 ttt ate aat tea ccc ggt gga tct att act get ggg atg gga ata tat 433 
Phe lie Asn Ser Pro Gly Gly Ser He Thr Ala Gly Met Gly He Tyr 
130 135 140 

gat gca atg aaa caa tgt aag gcg gat gta tct act gtt tgc tta ggg 481 
25 Asp Ala Met Lys Gin Cys Lys Ala Asp Val Ser Thr Val Cys Leu Gly 
145 150 155 * 160 

tta get gca tct atg ggt gcg ttt ctt ctt get tct ggt tea aaa ggg 529 
Leu Ala Ala Ser Met Gly Ala Phe Leu Leu Ala Ser Gly Ser Lys Gly 
30 165 170 175 

aaa egg tat tgt atg cct aac tct aaa gtt atg ate cat cag cca ctt 577 
Lys Arg Tyr Cys Met Pro Asn Ser Lys Val Met He His Gin Pro Leu 
180 185 190 



ggt act get gga ggc aaa gca acg gaa atg age ata cgt ata aga gaa 625 
Gly Thr Ala Gly Gly Lys Ala Thr Glu Met Ser He Arg He Arg Glu 
195 200 205 



40 atg atg tac cac aag att aaa ctt aac aaa ate ttc tct aga ate act 673 
Met Met Tyr His Lys He Lys Leu Asn Lys He Phe Ser Arg He Thr 
210 215 220 

ggg aag cct gaa tea gag ate gaa agt gac aca gac cgt gat aac ttc 721 
45 Gly Lys Pro Glu Ser Glu He Glu Ser Asp Thr Asp Arg Asp Asn Phe 
225 230 235 240 

ttg aat cca tgg gag gcg aaa gaa tat ggt ttg ate gac get gta ate 769 
Leu Asn Pro Trp Glu Ala Lys Glu Tyr Gly Leu He Asp Ala Val lie 
50 245 250 ~ 255 

gat gat ggg aaa ccg gga eta ate get cca att gga gat ggt act cct 817 
Asp Asp Gly Lys Pro Gly Leu He Ala Pro He Gly Asp Gly Thr Pro 
260 265 270 



cct cct aaa acc aaa gtc tgg gat ctt tgg aaa gtc gaa gga acc aag 865 
Pro Pro Lys Thr Lys Val Trp Asp Leu Trp Lys Val Glu Gly Thr Lys 
275 280 285 



60 aaa gac aac act aac ttg cca tct gag cgc tec atg aca cag aat ggt 913 
Lys Asp Asn Thr Asn Leu Pro Ser Glu Arg Ser Met Thr Gin Asn Gly 
290 295 " 300 

tat gee gee att gaa tag aactgttgtt geagegttta cgecttttat 961 
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Tyr Ala Ala lie Glu 
305 

atgttattct ggtggtacct gtaaccatat aacgttgcat ttcctgtgtt tgtaccattt 1021 

5 

ctctgataga ttttggaata atttgaaggc aaagatagat tattgtgtag agagctacaa 1081 
atttaatgat aaattgatca tcagcactgg aaagctaaaa aaa 1124 

10 

<210> 6 
<211> 309 
15 <212> PRT 

<213> Arabidopsis thaliana 

20 

<400> 6 

Met Glu Met Ser lieu Arg Leu Ala Ser Ser Ser Thr Ser Asn Pro lie 
15 10 15 

25 

Cys Leu Leu Asn Pro Gly Lys Asn Leu Asn Phe Pro lie Arg Asn His 
20 25 30 

30 

Arg lie Pro Lys Thr Ser Lys Pro Phe Cys Val Arg Ser Ser Met Ser 
35 40 45 

35 Leu Ser Lys Pro Pro Arg Gin Thr Leu Ser Ser Asn Trp Asp Val Ser 
50 55 60 

Ser Phe Ser lie Asp Ser Val Ala Gin Ser Pro Ser Arg Leu Pro Ser 
40 65 70 75 80 

Phe Glu Glu Leu Asp Thr Thr Asn Met Leu Leu Arg Gin Arg lie Val 
85 90 95 

45 

Phe Leu Gly Ser Gin Val Asp Asp Met Thr Ala Asp Leu Val lie Ser 
100 105 110 

50 

Gin Leu Leu Leu Leu Asp Ala Glu Asp Ser Glu Arg Asp lie Thr Leu 
115 120 125 

55 Phe lie Asn Ser Pro Gly Gly Ser He Thr Ala Gly Met Gly He Tyr 
130 135 140 

Asp Ala Met Lys Gin Cys Lys Ala Asp Val Ser Thr Val Cys Leu Gly 
60 145 150 155 160 



Leu Ala Ala Ser Met Gly Ala Phe Leu Leu Ala Ser Gly Ser Lys Gly 
165 170 " 175 
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Lys Arg Tyr Cys Met Pro Asn Ser Lys Val Met lie His Gin Pro Leu 
180 185 190 



10 



Gly Thr Ala Gly Gly Lys Ala Thr Glu Met Ser He Arg He Arg Glu 
195 200 205 



Met Met Tyr His Lys He Lys Leu Asn Lys He Phe Ser Arg He Thr 
210 215 220 



15 Gly Lys Pro Glu Ser Glu He Glu Ser Asp Thr Asp Arg Asp Asn Phe 
225 230 235 " 240 

LeU ASX1 Pr ° Trp GlU Ala LyS Glu Tyr Gly Leu Ile As P Ala Val Ile 
20 245 250 255 

Asp Asp Gly Lys Pro Gly Leu Ile Ala Pro lie Gly Asp Gly Thr Pro 
25 260 265 270 

Pro Pro Lys Thr Lys Val Trp Asp Leu Trp Lys Val Glu Gly Thr Lys 
275 280 285 



30 



Lys Asp Asn Thr Asn Leu Pro Ser Glu Arg Ser Met Thr Gin Asn Gly 
290 295 300 



35 Tyr Ala Ala Ile Glu 
305 



<210> 7 

40 

<211> 1183 

<212> DNA 

45 <213> Arabidopsis thaliana 



<220> 

50 

<221> CDS 
<222> (3).. (902) 
55 <223> 



<400> 7 

60 ct ttc ttc ttc ttc get tea gec atg gga acc eta tct etc tea tct 47 
Phe Phe Phe Phe Ala Ser Ala Met Gly Thr Leu Ser Leu Ser Ser 
15 10 15 



tct etc aaa cct tea etc gtt tea tea aga etc aat tea tct tec tec 



95 
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Ser Leu Lys Pro Ser Leu Val Ser Ser Arg Leu Asn Ser Ser Ser Ser 
20 25 30 

gca tct tct tct teg ttt cct aaa cca aac aat etc tac etc aaa ccc 143 
5 Ala Ser Ser Ser Ser Phe Pro Lys Pro Asn Asn Leu Tyr Leu Lys Pro 
35 40 45 

acc aaa etc att tea cca cct etc aga aca act tea cca teg cca ttg 191 
•Thr Lys Leu lie Ser Pro Pro Leu Arg Thr Thr Ser Pro Ser Pro Leu 
10 50 55 60 

aga ttc gee aat get tea ate gag atg teg cag aca cag gaa tea get 239 
Arg Phe Ala Asn Ala Ser lie Glu Met Ser Gin Thr Gin Glu Ser Ala 
65 70 75 



15 



35 



55 



att cgc gga get gaa tct gac gtc atg ggt ctt etc ctt agg gaa cga 287 
lie Arg Gly Ala Glu Ser Asp Val Met Gly Leu Leu Leu Arg Glu Arg 
80 85 90 95 



20 ate gtc ttt etc ggt agt agt ate gac gat ttc gtc get gat get att 335 
lie Val Phe Leu Gly Ser Ser He Asp Asp Phe Val Ala Asp Ala He 
100 105 110 

atg agt cag ttg ctt etc tta gat get aaa gat cca aag aaa gat ate 383 
25 Met Ser Gin Leu Leu Leu Leu Asp Ala Lys Asp Pro Lys Lys Asp He 
115 120 125 

aaa etc ttt ate aat tct cct ggt ggt tct etc agt gca acc atg get " 431 
Lys Leu Phe He Asn Ser Pro Gly Gly Ser Leu Ser Ala Thr Met Ala 
30 130 135 140 

ata tac gat gtg gtt cag ctt gtg aga get gat gtt teg acg att get 479 
He Tyr Asp Val Val Gin Leu Val Arg Ala Asp Val Ser Thr He Ala 
145 150 155 



ctt ggc att get gca tea aca get teg att att ctt ggt gcg gga act 527 
Leu Gly He Ala Ala Ser Thr Ala Ser He He Leu Gly Ala Gly Thr 
160 165 170 175 



40 aaa ggc aag cgc ttt get atg ccc aac acg agg ata atg att cat caa 575 
Lys Gly Lys Arg Phe Ala Met Pro Asn Thr Arg He Met He His Gin 
180 185 190 

cct ctt gga ggt gca age ggt caa get ata gat gtt gag att caa get 623 
45 Pro Leu Gly Gly Ala Ser Gly Gin Ala He Asp Val Glu He Gin Ala 
195 200 205 

aag gaa gtt atg cat aac aag aac aat gtc acc age att ate gcg gga 671 
Lys Glu Val Met His Asn Lys Asn Asn Val Thr Ser He He Ala Gly 
50 * 210 215 220 

tgt act agt cga teg ttt gag cag gtt ctg aaa gat att gat agg gac 719 
Cys Thr Ser Arg Ser Phe Glu Gin Val Leu Lys Asp He Asp Arg Asp 
225 230 235 



egg tac atg tct cca att gaa gca gtt gag tat ggt tta att gat gga 767 
Arg Tyr Met Ser Pro He Glu Ala Val Glu Tyr Gly Leu He Asp Gly 
240 245 250 255 



60 gtt att gat gga gac age att att cct ctt gaa cct gtt cct gat aga 815 
Val He Asp Gly Asp Ser He He Pro Leu Glu Pro Val Pro Asp Arg 
260 265 270 

gtg aaa ccg aga gta aac tac gag gag att age aag gat ccg atg aaa 863 
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Val Lys Pro Arg Val Asn Tyr Glu Glu lie Ser Lys Asp Pro Met Lys. 
275 280 285 

ttc ttg act ccc gag ata cct gat gat gag ate tac taa agccaagctc 912 
5 Phe Leu Thr Pro Glu lie Pro Asp Asp Glu lie Tyr 
290 295 

gtctagaagc agggatcttc aaatgtgact aagactagca gtttcgagga aaagctcaat 972 

10 ttcttctgcg gttactggta ttggctttgc gaaaccgaag ctggtagtac ttggcttttg 1032 

tatctcatat ttcagttgtt cagaaaataa ttgttcttta aatcactctg ttttgaggaa 1092 

aatgacttaa agaagctgta gttatctcgt ttatgacaat cccttcaagt gtttaatgga 1152 

ttcaagaagt atcagtcagt atttttgtgg t 1183 



15 



20 



<210> 8 

<211> 299 

<212> PRT 

25 <213> Arabidopsis thaliana 



30 



45 



50 



<400> 8 

Phe Phe Phe Phe Ala Ser Ala Met Gly Thr Leu Ser Leu Ser Ser Ser 
15 10 15 



35 Leu Lys Pro Ser Leu Val Ser Ser Arg Leu Asn Ser Ser Ser Ser Ala 
20 25 30 

Ser Ser Ser Ser Phe Pro Lys Pro Asn Asn Leu Tyr Leu Lys Pro Thr 
40 35 40 45 



Lys Leu lie Ser Pro Pro Leu Arg Thr Thr Ser Pro Ser Pro Leu Arg 
50 55 60 



Phe Ala Asn Ala Ser lie Glu Met Ser Gin Thr Gin Glu Ser Ala lie 
65 70 75 80 



Arg Gly Ala Glu Ser Asp Val Met Gly Leu Leu Leu Arg Glu Arg He 
85 90 "* 95 



55 Val Phe Leu Gly Ser Ser He Asp Asp Phe Val Ala Asp Ala He Met 
100 105 " 110 

Ser Gin Leu Leu Leu Leu Asp Ala Lys Asp Pro Lys Lys Asp He Lys 
60 115 120 125 



Leu Phe He Asn Ser Pro Gly Gly Ser Leu Ser Ala Thr Met Ala He 
130 135 140 
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Tyr Asp Val Val Gin Leu Val Arg Ala Asp Val Ser Thr He Ala Leu 
_ 145 150 155 160 

O 

Gly He Ala Ala Ser Thr Ala Ser He He Leu Gly Ala Gly Thr Lys 
. 165 170 ~ 175 

10 

Gly Lys Arg Phe Ala Met Pro Asn Thr Arg He Met He His Gin Pro 
180 185 190 



15 Leu Gly Gly Ala Ser Gly Gin Ala He Asp Val Glu He Gin Ala Lys 
195 200 205 

Glu Val Met His Asn Lys Asn Asn Val Thr Ser He He Ala Gly Cys 
20 210 215 220 

Thr Ser Arg Ser Phe Glu Gin Val Leu Lys Asp He Asp Arg Asp Arg 
25 225 230 235 ~ 240 

Tyr Met Ser Pro He Glu Ala Val Glu Tyr Gly Leu He Asp Gly Val 
245 250 255 

30 

He Asp Gly Asp Ser He He Pro Leu Glu Pro Val Pro Asp Arg Val 
260 265 270 

35 Lys Pro Arg Val Asn Tyr Glu Glu He Ser Lys Asp Pro Met Lys Phe 
275 280 285 

Leu Thr Pro Glu He Pro Asp Asp Glu He Tyr 
40 290 295 

<210> 9 

45 <211> 1056 

<212> DNA 

<213> Arabidopsis thaliana 

50 



<220> 
55 <221> CDS 

<222> (61) . . (876) 
<223> 

60 



<400> 9 

gagtaattta gcatctatcc acgcctgaac ccgaaaaact ctgaaagctg agctctggta 



60 
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atg gcg ggt tta gca att tea cct cct etc ggt ctt tec ttc tct tct 108 

Met Ala Gly Leu Ala lie Ser Pro Pro Leu Gly Leu Ser Phe Ser Ser 
1 5 10 15 

5 

cga act cga aac cct aaa ccc act tec ttt eta tct cac aat caa agg 156 

Arg Thr Arg Asn Pro Lys Pro Thr Ser Phe Leu Ser His Asn Gin Arg 
20 25 30 

10 aat cct ata aga cgt ata gtt tct get eta cag agt cca tat gga gat 204 
Asn Pro lie Arg Arg lie Val Ser Ala Leu Gin Ser Pro Tyr Gly Asp 
35 40 45 

tct ctg aaa get gga ctt tct agt aat gtt tct gga tec cca ata aag 252 
15 Ser Leu Lys Ala Gly Leu Ser Ser Asn Val Ser Gly Ser Pro lie Lys 
50 55 60 

att gac aac aag get cca aga ttt gga gtg ata gag gcg aaa aag gga 300 
He Asp Asn Lys Ala Pro Arg Phe Gly Val He Glu Ala Lys Lys Gly 
20 65 70 75 80 

aac ccc cca gta atg cct tea gtg atg acc cct gga gga cct tta gac 348 
Asn Pro Pro Val Met Pro Ser Val Met Thr Pro Gly Gly Pro Leu Asp 
85 90 95 



25 



45 



etc tct tct gtg tta ttc cgt aac cgc ata ate ttc ate ggg caa cca 396 
Leu Ser Ser Val Leu Phe Arg Asn Arg He He Phe He Gly Gin Pro 
100 105 110 



30 att aac gca cag gtt get cag cga gtc ata tct cag ctt gta acc ctt 444 
He Asn Ala Gin Val Ala Gin Arg Val He Ser Gin Leu Val Thr Leu 
115 120 125 

gca tct att gat gat aaa tec gac ate ctg atg tac ttg aat tgt ccc 492 
35 Ala Ser He Asp Asp Lys Ser Asp He Leu Met Tyr Leu Asn Cys Pro 
130 135 140 

SSft ggc agt act tac tec gtc eta aca att tat gac tgt atg tct tgg 540 
Gly Gly Ser Thr Tyr Ser Val Leu Thr He Tyr Asp Cys Met Ser Trp 
40 145 150 155 160 

ata aag cct aaa gtt gga aca gtg gcg ttt gga gta get gca age caa 588 
He Lys Pro Lys Val Gly Thr Val Ala Phe Gly Val Ala Ala Ser Gin 
165 170 175 



gga gca ttt ttt ctt get gga ggt gaa aaa gga atg cgt tat gca atg 636 
Gly Ala Phe Phe Leu Ala Gly Gly Glu Lys Gly Met Arg Tyr Ala Met 
180 185 190 



50 cca aat act cgt gtc atg ata cat caa cca caa act gga tgc gga gga 684 
Pro Asn Thr Arg Val Met He His Gin Pro Gin Thr Gly Cys Gly Gly 
195 200 205 

cat gta gag gac gtg agg aga cag gtc aat gaa gee ate gaa gee cga 732 
55 His Val Glu Asp Val Arg Arg Gin Val Asn Glu Ala He Glu Ala Arg 
210 215 220 

caa aaa att gac agg atg tat gca get ttc act gga caa cct ctg gag 780 
Gin Lys He Asp Arg Met Tyr Ala Ala Phe Thr Gly Gin Pro Leu Glu 
60 225 230 235 " 240 

aaa gtg cag caa tac act gaa aga gat cgt ttc tta tea gca tct gag 828 
Lys Val Gin Gin Tyr Thr Glu Arg Asp Arg Phe Leu Ser Ala Ser Glu 
245 250 255 
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gcg ttt gag ttc ggg etc att gat ggt eta ttg gaa aca gaa tac tga 876 
Ala Phe Glu Phe Gly Leu He Asp Gly Leu Leu Glu Thr Glu Tyr 
260 265 270 

5 

agcagcatac aggacaatgc acaacaacag etcattgeaa tgttcaaagc ttccattttc 936 
atttgaatat gaacggttgt aactgatatt tgtgcataaa tcagtttggt tttcttggtt 996 
10 ttattgtcta ctaaacagaa tgagaaaact aaactgttta tttttttact gaaaaatctg 1056 

<210> 10 

15 <211> 271 

<212> PRT 

<213> Arabidopsis thaliana 

20 

<400> 10 

25 Met Ala Gly Leu Ala He Ser Pro Pro Leu Gly Leu Ser Phe Ser Ser 
1 5 10 15 

Arg Thr Arg Asn Pro Lys Pro Thr Ser Phe Leu Ser His Asn Gin Arg 
30 20 25 30 

Asn Pro He Arg Arg He Val Ser Ala Leu Gin Ser Pro Tyr Gly Asp 
35 40 45 

35 

Ser Leu Lys Ala Gly Leu Ser Ser Asn Val Ser Gly Ser Pro He Lys 
50 55 60 

40 

He Asp Asn Lys Ala Pro Arg Phe Gly Val He Glu Ala Lys Lys Gly 
65 70 75 80 

45 Asn Pro Pro Val Met Pro Ser Val Met Thr Pro Gly Gly Pro Leu Asp 

85 90 " 95 

Leu Ser Ser Val Leu Phe Arg Asn Arg He He Phe He Gly Gin Pro 
50 100 105 110 

He Asn Ala Gin Val Ala Gin Arg Val lie Ser Gin Leu Val Thr Leu 
115 120 125 

55 

Ala Ser He Asp Asp Lys Ser Asp He Leu Met Tyr Leu Asn Cys Pro 
130 135 140 

60 

Gly Gly Ser Thr Tyr Ser Val Leu Thr He Tyr Asp Cys Met Ser Trp 
145 150 155 160 
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He Lys Pro Lys Val Gly Thr Val Ala Phe Gly Val Ala Ala Ser Gla 
165 170 175 



5 Gly Ala Phe Phe Leu Ala Gly Gly Glu Lys Gly Met Arg Tyr Ala Met 
180 185 190 

Pro Asn Thr Arg Val Met He His Gin Pro Gin Thr Gly Cys Gly Gly 
10 195 200 205 



15 



20 



30 



His Val Glu Asp Val Arg Arg Gin Val Asn Glu Ala He Glu Ala Arg 
210 215 220 



Gin Lys He Asp Arg Met Tyr Ala Ala Phe Thr Gly Gin Pro Leu Glu 
225 230 235 240 



Lys Val Gin Gin Tyr Thr Glu Arg Asp Arg Phe Leu Ser Ala Ser Glu 
245 250 255 



25 Ala Phe Glu Phe Gly Leu He Asp Gly Leu Leu Glu Thr Glu Tyr 
260 265 270 



<210> 11 

<211> 1448 

<212> DNA 

35 <213> Nicotiana tabacum 

<220> 

40 

<221> CDS 

<222> (2).. (1162) 
45 <223> 



<400> 11 

50 g egg ccg ctg get tct tct ttg ctt etc tct ccg ctt tct age teg acg 49 
Arg Pro Leu Ala Ser Ser Leu Leu Leu Ser Pro Leu Ser Ser Ser Thr 
15 10 15 

gtt act gaa aat cgc gag ctg ggt tct ggt aaa tea act ttc ata tec 97 
55 Val Thr Glu Asn Arg Glu Leu Gly Ser Gly Lys Ser Thr Phe He Ser 
20 25 30 

agt ccc aat ttc tec ttt gca act tct gtt cac agt tgc agg cca aac 145 
Ser Pro Asn Phe Ser Phe Ala Thr Ser Val His Ser Cys Arg Pro Asn 
60 35 40 45 

ggc gtt cga ggt tat tgt tac agg tct ccg gta get aag tct ttg gac 193 
Gly Val Arg Gly Tyr Cys Tyr Arg Ser Pro Val Ala Lys Ser Leu Asp 
50 55 60 
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cat ata ccc caa aaa ttc aga ctg gaa aat etc aaa gat gga eta ctg 241 

His lie Pro Gin Lys Phe Arg Leu Glu Asn Leu Lys Asp Gly Leu Leu 

65 70 75 ~ 80 

5 

gac aac tat aaa agt gec cct cag tat ctt tac ggc ctt agt cct tea 289 

Asp Asn Tyr Lys Ser Ala Pro Gin Tyr Leu Tyr Gly Leu Ser Pro Ser 
85 90 * 95 

10 cag atg gat atg ttc atg aca gaa gat aac cca gta egg cga cag tea 337 
Gin Met. Asp Met Phe Met Thr Glu Asp Asn Pro Val Arg Arg Gin Ser 
100 105 110 

gaa agt gee act gag gat agt ata tct tea gee aat aac tat ctg age 385 
15 Glu Ser Ala Thr Glu Asp Ser lie Ser Ser Ala Asn Asn Tyr Leu Ser 
115 120 125 

aat ggt gga atg tgg agt atg tec ggc atg aac gat egg ggc ccc teg 433 
Asn Gly Gly Met Trp Ser Met Ser Gly Met Asn Asp Arg Gly Pro Ser 
20 13 0 135 140 



25 



45 



aaa tac agt atg agt gtc age atg tac cgt gga gga aca aga gga tct 481 
Lys Tyr Ser Met Ser Val Ser Met Tyr Arg Gly Gly Thr Arg Gly Ser 
145 150 155 ' " " 160 

gga aga cct cga act gcg cct cct gat ttg cca tct ttg ctt ttg gat 529 
Gly Arg Pro Arg Thr Ala Pro Pro Asp Leu Pro Ser Leu Leu Leu Asp 
165 170 175 



30 get cga att gtc tat ctg ggc atg cct att gta cca get gtt aca gag 577 
Ala Arg lie Val Tyr Leu Gly Met Pro lie Val Pro Ala Val Thr Glu 
180 185 190 

ctt ctt gtt get cag tttatg tgg ttg gat tat gac aat cca tea aag 625 
35 Leu Leu Val Ala Gin Phe Met Trp Leu Asp Tyr Asp Asn Pro Ser Lys 
195 200 *" 205 

cct ata tac eta tat ata aac tea tea ggc aca cag aat gag aag atg 673 
Pro lie Tyr Leu Tyr He Asn Ser Ser Gly Thr Gin Asn Glu Lys Met 
40 210 215 220 

gag act gtt ggg tct gaa aca gag gca tat gec ate get gac aca atg 721 
Glu Thr Val Gly Ser Glu Thr Glu Ala Tyr Ala He Ala Asp Thr Met 
225 230 235 240 



gca tac tgc aaa tea gat ate tat aca gtg aac tgt ggc atg gca tat 769 
Ala Tyr Cys Lys Ser Asp He Tyr Thr Val Asn Cys Gly Met Ala Tyr 
245 250 255 



50 ggt caa gca gca atg ctt ctg tea ctg gga aag aag ggg ttc cgt get 817 
Gly Gin Ala Ala Met Leu Leu Ser Leu Gly Lys Lys Gly Phe Arg Ala 
260 265 ~ ^ 270 

atg cag cca aat tea tct aca aaa ttg tat tta cct aaa gtc age aaa 865 
55 Met Gin Pro Asn Ser Ser Thr Lys Leu Tyr Leu Pro Lys Val Ser Lys 
275 280 285 

tec agt gga gca gtg ata gat atg tgg ate agg gec aaa gaa eta gat 913 
Ser Ser Gly Ala Val He Asp Met Trp He Arg Ala Lys Glu Leu Asp 
60 290 295 300 

gca aac act gag tat tac ctt gaa eta tta gcg aaa gga gtt gga aaa 961 
Ala Asn Thr Glu Tyr Tyr Leu Glu Leu Leu Ala Lys Gly Val Gly Lys 
305 310 315 320 
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cca aag gaa gaa ate gag aaa gat att caa cgc cct aaa tat ctg egg 1009 
Pro Lys Glu Glu lie Glu Lys Asp He Gin Arg Pro Lys Tyr Leu Arg 
325 330 " 335 

gca caa gaa gec att gac tat ggc att gcg gac aag ata ate gat tea 1057 
Ala Gin Glu Ala He Asp Tyr Gly He Ala Asp Lys He He Asp Ser 
340 345 350 

10 aga gac aat gca ttt gag aaa agg aac tat ggt gag ata etc gec caa 1105 
Arg Asp Asn Ala Phe Glu Lys Arg Asn Tyr Gly Glu lie Leu Ala Gin 
355 360 365 

tct aga get atg agg aaa gec gga cca ggt get cag get get cca tct 1153 
15 Ser Arg Ala Met Arg Lys Ala Gly Pro Gly Ala Gin Ala Ala Pro Ser 
370 375 380 

ggc tec agg tgactggaag ageggtaatg gtcccaagct ttcaggaaca 1202 
Gly Ser Arg 
20 385 

actgttgttc ccttatagtt tcgaggaaca aagttgctgg ttacttggtc tgtgccggta 1262 

taatgtaact gggacaaaga acatattgta gaaaccttgt ttgagctgtg aagtataggg 1322 

gttttacaac tattatgeae aggtctgeaa agagtaccca taatgtcaat tggttgtacc 1382 

agtatcaaac aatcagatag tgccagtgta tggtataaat gaatatagat ctctctgagc 1442 

ggcege 1448 



25 



30 



<210> 12 

35 <211> 387 

<212> PRT 

<213> Nicotiana tabacum 

40 



<400> 12 

45 Arg Pro Leu Ala Ser Ser Leu Leu Leu Ser Pro Leu Ser Ser Ser Thr 
15 10 15 

Val Thr Glu Asn Arg Glu Leu Gly Ser Gly Lys Ser Thr Phe He Ser 
50 20 25 30 



55 



60 



Ser Pro Asn Phe Ser Phe Ala Thr Ser Val His Ser Cys Arg Pro Asn 
35 40 45 



Gly Val Arg Gly Tyr Cys Tyr Arg Ser Pro Val Ala Lys Ser Leu Asp 
50 55 60 



His He Pro Gin Lys Phe Arg Leu Glu Asn Leu Lys Asp Gly Leu Leu 
G5 70 75 QQ 
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Asp Asn Tyr Lys Ser Ala Pro Gin Tyr Leu Tyr Gly Leu Ser Pro Ser. 

85 90 95 



5 Gin Met Asp Met Phe Met Thr Glu Asp Asn Pro Val Arg Arg Gin Ser 
100 105 110 



Glu Ser Ala Thr Glu Asp Ser lie Ser Ser Ala Asn Asn Tyr Leu Ser 

10 115 120 125 

* 

Asn Gly Gly Met Trp Ser Met Ser Gly Met Asn Asp Arg Gly Pro Ser 
130 135 140 

15 

Lys Tyr Ser Met Ser Val Ser Met Tyr Arg Gly Gly Thr Arg Gly Ser 
145 150 155 . 160 

20 

Gly Arg Pro Arg Thr Ala Pro Pro Asp Leu Pro Ser Leu Leu Leu Asp 
165 170 175 



25 Ala Arg lie Val Tyr Leu Gly Met Pro lie Val Pro Ala Val Thr Glu 
180 185 190 



Leu Leu Val Ala Gin Phe Met Trp Leu Asp Tyr Asp Asn Pro Ser Lys 
30 195 200 205 

Pro lie Tyr Leu Tyr He Asn Ser Ser Gly Thr Gin Asn Glu Lys Met 
210 215 220 

35 

Glu Thr Val Gly Ser Glu Thr Glu Ala Tyr Ala He Ala Asp Thr Met 
225 230 235 240 

40 

Ala Tyr Cys Lys Ser Asp He Tyr Thr Val Asn Cys Gly Met Ala Tyr 
245 250 255 



45 Gly Gin Ala Ala Met Leu Leu Ser Leu Gly Lys Lys Gly Phe Arg Ala 
260 265 270 



Met Gin Pro Asn Ser Ser Thr Lys Leu Tyr Leu Pro Lys Val Ser Lys 
50 275 280 285 

Ser Ser Gly Ala Val He Asp Met Trp lie Arg Ala Lys Glu Leu Asp 
290 295 300 

55 

Ala Asn Thr Glu Tyr Tyr Leu Glu Leu Leu Ala Lys Gly Val Gly Lys 
305 310 315 320 

60 

Pro Lys Glu Glu He Glu Lys Asp He Gin Arg Pro Lys Tyr Leu Arg 
325 330 335 
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Ala Gin Glu Ala He Asp Tyr Gly He Ala Asp Lys He He Asp Ser 
340 345 350 

5 Arg Asp Asn Ala Phe Glu Lys Arg Asn Tyr Gly Glu He Leu Ala Gin 
355 360 365 

Ser Arg Ala Met Arg Lys Ala Gly Pro Gly Ala Gin Ala Ala Pro Ser 
10 370 375 380 



15 



20 



25 



30 



45 



Gly Ser Arg 
385 



<210> 13 

<211> 1246 

<212> DNA 

<213> Arabidopsis thaliana 
<220> 

<221> CDS 

<222> (38) • . (1030) 

<223> 



35 

<400> 13 

attttcgcga gcttccgtgt ccaagagctc ctcgacc atg gcg tct tgt tta caa 55 

Met Ala Ser Cys Leu Gin 
40 is 

gca tec atg aat tct ctg ctt cca cgc tct tct tct ttt tct cct cat 103 
Ala Ser Met Asn Ser Leu Leu Pro Arg Ser Ser Ser Phe Ser Pro His 
10 15 20 

cct cct eta tct teg aat tea tec ggg aga aga aac ttg aag act ttt 151 
Pro Pro Leu Ser Ser Asn Ser Ser Gly Arg Arg Asn Leu Lys Thr Phe 
25 30 35 



50 cgt tac gee ttt cgc gee aaa gee tct gec aaa ate cct atg cct ccg 199 
Arg Tyr Ala Phe Arg Ala Lys Ala Ser Ala Lys He Pro Met Pro Pro 
40 45 50 

ata aat cca aag gat cct ttc etc tec acg etc get tct att gee gcg 247 
55 He Asn Pro Lys Asp Pro Phe Leu Ser Thr Leu Ala Ser He Ala Ala 
55 60 65 70 

aat tct ccg gaa aag ctt etc aat egg ccg gtt aac get gat gtg ccg 295 
Asn Ser Pro Glu Lys Leu Leu Asn Arg Pro Val Asn Ala Asp Val Pro 
60 75 80 85 

cca tat ctt gac ate ttt gac tec cct cag etc atg tct tct cct gca 343 
Pro Tyr Leu Asp He Phe Asp Ser Pro Gin Leu Met Ser Ser Pro Ala 
90 95 100 
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cag gtt gaa aga tea gtg get tat aac gag cac cga ccg aga act cct 391 

Gin Val Glu Arg Ser Val Ala Tyx Asn Glu His Arg Pro Arg Thr Pro 

105 110 115 

5 

cca cca gac ttg cca tct atg ctt ctt gac ggg aga att gtt tac att 439 

Pro Pro Asp Leu Pro Ser Met Leu Leu Asp Gly Arg lie Val Tyr lie 
120 125 130 

10 gga atg cct ctt gtg ccg gca gtg act gag eta gtt gtc get gag eta 487 
Gly Met Pro Leu Val Pro Ala Val Thr Glu Leu Val Val Ala Glu Leu 
135 140 145 150 

atg tat ctt cag tgg ctg gat ccc aag gaa ccc att tac att tac ate 535 
15 Met Tyr Leu Gin Trp Leu Asp Pro Lys Glu Pro He Tyr He Tyr He 

155 160 165 

aac tec aca ggg acc act cgt gat gat gga gag acg gtt gga atg gaa 583 
Asn Ser Thr Gly Thr Thr Arg Asp Asp Gly Glu Thr Val Gly Met Glu 
20 170 175 180 

tea gaa ggg ttt gcg ate tat gac tct ttg atg caa ctt aaa aac gag 631 
Ser Glu Gly Phe Ala He Tyr Asp Ser Leu Met Gin Leu Lys Asn Glu 
185 190 195 



25 



45 



gta cat aca gta tgt gtg gga gca gee ata ggt cag gee tgt eta tta 679 
Val His Thr Val Cys Val Gly Ala Ala He Gly Gin Ala Cys Leu Leu 
200 205 210 



30 ctt tct gcg gga acc aag ggt aaa egg ttt atg atg cca cat gee aaa 727 
Leu Ser Ala Gly Thr Lys Gly Lys Arg Phe Met Met Pro His Ala Lys 
215 220 225 230 

gcg atg att cag caa cca cgt gta cct tct tct ggg ttg atg cct gee 775 
35 Ala Met He Gin Gin Pro Arg Val Pro Ser Ser Gly Leu Met Pro Ala 

235 240 245 

agt gat gtc ctg att egg gee aaa gag gtc att aca aat agg gat ata 823 
Ser Asp Val Leu He Arg Ala Lys Glu Val He Thr Asn Arg Asp lie 
40 250 255 260 

ctt gtg gaa eta eta tea aag cat act ggg aat tec gtg gag act gta 871 
Leu Val Glu Leu Leu Ser Lys His Thr Gly Asn Ser Val Glu Thr Val 
265 270 275 



get aac gta atg aga agg cca tat tac atg gat gca cca aaa get aaa 919 
Ala Asn Val Met Arg Arg Pro Tyr Tyr Met Asp Ala Pro Lys Ala Lys 
280 285 290 



50 gaa ttt gga gtc att gac agg att ctt tgg cgc ggt caa gaa aag att 967 
Glu Phe Gly Val He Asp Arg He Leu Trp Arg Gly Gin Glu Lys He 
295 300 305 310 

att gcg gac gtg gtt cct tea gag gaa ttc gac aag aat gca ggg att 1015 
55 He Ala Asp Val Val Pro Ser Glu Glu Phe Asp Lys Asn Ala Gly He 

315 320 325 

aaa age gta gta tga gtctagtctt aagttttctt ggectaaate atactgegtc 1070 
Lys Ser Val Val 
60 330 

atggagaaga acaaatagac tgaccaaaat cacattggcc gcagactgcc ttgtttcaaa 1130 

tcacttggta aatgtgaaca tgegattagg agaatcatac ttaaaggatc ttgaaatatt 1190 



I 
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atgataaaat tgtaatgtgt ttgttcgtta gcaatagtaa atacaatctt caactc 1246 

5 <210> 14 

<211> 330 

•<212> PRT 

10 

<213> Arabidopsis thaliana 



15 <400> 14 

Met Ala Ser Cys Leu Gin Ala Ser Met Asn Ser Leu Leu Pro Arg Ser 
15 10 15 

20 

Ser Ser Phe Ser Pro His Pro Pro Leu Ser Ser Asn Ser Ser Gly Arg 
20 25 30 

25 Arg Asn Leu Lys Thr Phe Arg Tyr Ala Phe Arg Ala Lys Ala Ser Ala 
35 40 45 

Lys He Pro Met Pro Pro He Asn Pro Lys Asp Pro Phe Leu Ser Thr 
30 50 55 60 

Leu Ala Ser He Ala Ala Asn Ser Pro Glu Lys Leu Leu Asn Arg Pro 
65 70 75 80 

35 

Val Asn Ala Asp Val Pro Pro Tyr Leu Asp He Phe Asp Ser Pro Gin 
85 90 95 

40 

Leu Met Ser Ser Pro Ala Gin Val Glu Arg Ser Val Ala Tyr Asn Glu 
100 105 ' 110 



45 His Arg Pro Arg Thr Pro Pro Pro Asp Leu Pro Ser Met Leu Leu Asp 
115 120 125 

Gly Arg He Val Tyr He Gly Met Pro Leu Val Pro Ala Val Thr Glu 
50 130 135 140 



55 



Leu Val Val Ala Glu Leu Met Tyr Leu Gin Trp Leu Asp Pro Lys Glu 
145 150 155 160 



60 



Pro He Tyr He Tyr He Asn Ser Thr Gly Thr Thr Arg Asp Asp Gly 
165 170 175 



Glu Thr Val Gly Met Glu Ser Glu Gly Phe Ala He Tyr Asp Ser Leu 
180 185 190 
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Met Gin Leu Lys Asn Glu Val His Thr Val Cys Val Gly Ala Ala lie 
195 200 205 



Gly Gin Ala Cys Leu Leu Leu Ser Ala Gly Thr Lys Gly Lys Arg Phe 
210 215 220 



Met Met Pro His Ala Lys Ala Met lie Gin Gin Pro Arg Val Pro Ser 
10 225 230 235 240 

Ser Gly Leu Met Pro Ala Ser Asp Val Leu He Arg Ala Lys Glu Val 
245 250 255 

15 

He Thr Asn Arg Asp He Leu Val Glu Leu Leu Ser Lys His Thr Gly 
260 265 270 

20 

Asn Ser Val Glu Thr Val Ala Asn Val Met Arg Arg Pro Tyr Tyr Met 
275 280 285 

25 Asp Ala Pro Lys Ala Lys Glu Phe Gly Val lie Asp Arg He Leu Trp 
290 295 300 

Arg Gly Gin Glu Lys He He Ala Asp • Val Val Pro Ser Glu Glu Phe 
30 305 310 315 320 

Asp Lys Asn Ala Gly He Lys Ser Val Val 
325 330 

35 

<210> 15 
<211> 1236 

40 

<212> DNA 

<213> Arabidopsis thaliana 

45 

<220> 

<221> CDS 

50 

<222> (66).. (983) 
<223> 

55 

<400> 15 

agatcgttat cgtttcgggg tcacagggac tttcactctt tctctctctc tgcaacaaag 60 

60 aagaa atg gag gta gca gca gcg act gcg acg age ttc aca acg ctt cga 110 
Met Glu Val Ala Ala Ala Thr Ala Thr Ser Phe Thr Thr Leu Arg 
1 5 10 15 

get cgt acg tea gcg att ate ccg tct tct aca cgt aat ctg aga tct 158 
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Ala Arg Thr Ser Ala lie lie Pro Ser Ser Thr Arg Asn Leu Arg Ser 
20 25 30 

aaa ccg aga ttt tct tea tct tea tct etc aga get tct ctt teg aat 206 

5 Lys Pro Arg Phe Ser Ser Ser Ser Ser Leu Arg Ala Ser Leu Ser Asn 

35 40 45 

ggc ttt ctt teg ccg tat acc gga gga age ate tct agt gac tta tgc 254 

Gly Phe Leu Ser Pro Tyr Thr Gly Gly Ser lie Ser Ser Asp Leu Cys 
10 50 55 60 

ggc get aag ctt cgt gcg gaa teg ctt aat ccg tta aat ttt tec agt 302 

Gly Ala Lys Leu Arg Ala Glu Ser Leu Asn Pro Leu Asn Phe Ser Ser 
65 70 75 



15 



35 



55 



tec aag cct aaa cgc gga gtt gtc act atg gtt ata cct ttc tea aag 350 
Ser Lys Pro Lys Arg Gly Val Val Thr Met Val lie Pro Phe Ser Lys 
80 85 90 95 



20 gga agt gca cac gaa caa cct cct cct gat ttg gca tea tat ttg ttc 398 

Gly Ser Ala His Glu Gin Pro Pro Pro Asp Leu Ala Ser Tyr Leu Phe 

100 105 110 

aag aac cga att gta tat ttg gga atg tct etc gta cct tea gtt act 446 

25 Lys Asn Arg lie Val Tyr Leu Gly Met Ser Leu Val Pro Ser Val Thr 

115 120 125 

gag ttg ata ctt gcg gag ttt ctt tac ctt cag tat gaa gac gag gaa 494 

Glu Leu lie Leu Ala Glu Phe Leu Tyr Leu Gin Tyr Glu Asp Glu Glu 

30 130 135 140 

aag cct att tac ctt tac ata aac teg act ggg aca acc aag aat ggt 542 

Lys Pro lie Tyr Leu Tyr lie Asn Ser Thr Gly Thr Thr Lys Asn Gly 

145 150 155 



gaa aag ttg ggc tat gat act gag get ttt gca ate tat gat gtc atg 590 
Glu Lys Leu Gly Tyr Asp Thr Glu Ala Phe Ala lie Tyr Asp Val Met 
160 165 170 175 



40 ggg tat gtc aaa cca cca ate ttt act ctt tgc gtc ggg aat gcg tgg 63 8 

Gly Tyr Val Lys Pro Pro lie Phe Thr Leu Cys Val Gly Asn Ala Trp 
180 185 190 

ggt gaa get get ttg ctt ctg act get ggt gca aaa gga aat cga tct 686 
45 Gly Glu Ala Ala Leu Leu Leu Thr Ala Gly Ala Lys Gly Asn Arg Ser 
195 200 205 

gcg ttg ccc tea tea act att atg ata aag cag ccc att get cga ttt 734 
Ala Leu Pro Ser Ser Thr lie Met lie Lys Gin Pro He Ala Arg Phe 
50 210 215 220 

caa ggc caa gca act gat gtt gaa att gca agg aaa gaa ate aag cac 782 
Gin Gly Gin Ala Thr Asp Val Glu He Ala Arg Lys Glu He Lys His 
225 230 235 



ata aag aca gaa atg gtc aag ctg tat tea aag cat att ggt aaa tec 830 
He Lys Thr Glu Met Val Lys Leu Tyr Ser Lys His He Gly Lys Ser 
240 245 250 255 



60 ccg gag cag att gaa get gac atg aaa cgc ccg aaa tat ttt agt ccc 878 
Pro Glu Gin He Glu Ala Asp Met Lys Arg Pro Lys Tyr Phe Ser Pro 
260 265 270 



act gag get gtt gaa tat ggg ate att gat aag gtg gtt tac aat gaa 



926 
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Thr Glu Ala Val Glu Tyr Gly He He Asp Lys Val Val Tyr Asn Glu 
275 280 ~ 285 

a 99 ggc age caa gac aga gga gtt gtg tct gac ctt aaa aag gca caa 974 
5 Arg Gly Ser Gin Asp Arg Gly Val Val Ser Asp Leu Lys Lys Ala Gin 
290 295 " 300 

etc att tga atgtcagaac tgtcttccga aatcccatga ttaacaggtt 1023 
Leu He 
10 305 

ggagatctta ccgctgatca aatggggaat cagtgaacca ttcaccggca cagaactgag 1083 
gtaaagtctg gaaaacatgt taaaaaaggt tactagtaat getgeaattg tagggttatt 1143 

15 

tgaacagaaa caaacccata tgtgtaggct tgtgaatgcc tagaaacagg attggtgtat 1203 
cttcaatata tgtttctaag atgaatcaat ttc 1236 

20 

<210> 16 
<211> 305 
25 <212> PRT 

<213> Arabidopsis thaliana 



30 



35 



40 



55 



60 



<400> 16 



Met Glu Val Ala Ala Ala Thr Ala Thr Ser Phe Thr Thr Leu Arg Ala 
15 10 15 

Arg Thr Ser Ala He He Pro Ser Ser Thr Arg Asn Leu Arg Ser Lys 
20 25 30 

Pro Arg Phe Ser Ser Ser Ser Ser Leu Arg Ala Ser Leu Ser Asn Gly 
35 40 45 

45 Phe Leu Ser Pro Tyr Thr Gly Gly Ser He Ser Ser Asp Leu Cys Gly 
50 55 60 

Ala Lys Leu Arg Ala Glu Ser Leu Asn Pro Leu Asn Phe Ser Ser Ser 
50 65 70 75 80 



Lys Pro Lys Arg Gly Val Val Thr Met Val lie Pro Phe Ser Lys Gly 
85 90 95 



Ser Ala His Glu Gin Pro Pro Pro Asp Leu Ala Ser Tyr Leu Phe Lys 
100 105 110 



Asn Arg He Val Tyr Leu Gly Met Ser Leu Val Pro Ser Val Thr Glu 
115 120 125 
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Leu lie Leu Ala Glu Phe Leu Tyr Leu Gin Tyr Glu Asp Glu Glu Lys 
130 135 140 

5 Pro lie Tyr Leu Tyr lie Asn Ser Thr Gly Thr Thr Lys Asn Gly Glu 
145 150 155 160 



•Lys Leu Gly Tyr Asp Thr Glu Ala Phe Ala He Tyr Asp Val Met Gly 
10 165 170 175 



Tyr Val Lys Pro Pro He Phe Thr Leu Cys Val Gly Asn Ala Trp Gly 
180 185 190 

15 

Glu Ala Ala Leu Leu Leu Thr Ala Gly Ala Lys Gly Asn Arg Ser Ala 
195 200 205 

20 

Leu Pro Ser Ser Thr He Met He Lys Gin Pro He Ala Arg Phe Gin 
210 215 220 

25 Gly Gin Ala Thr Asp Val Glu He Ala Arg Lys Glu He Lys His He 
225 230 235 240 



Lys Thr Glu Met Val Lys Leu Tyr Ser Lys His He Gly Lys Ser Pro 
30 245 250 255 



Glu Gin He Glu Ala Asp Met Lys Arg Pro Lys Tyr Phe Ser Pro Thr 
260 265 270 

35 

Glu Ala Val Glu Tyr Gly He He Asp Lys Val Val Tyr Asn Glu Arg 
275 280 285 

40 

Gly Ser Gin Asp Arg Gly Val Val Ser Asp Leu Lys Lys Ala Gin Leu 
290 295 300 

45 He 
305 



<210> 17 

50 

<211> 906 

<212> DNA 

55 <213> Nicotiana tabacum 



<220> 

60 

<221> CDS 

<222> (45) . . (755) 
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<223> 



5 <400> 17 

gcggccgctc caagattcat ccccaactct caacacattc aact atg cgc acc caa 56 

Met Arg Thr Gin 
1 

10 att gtt cac aaa etc ttt aac cga aga ate aac gga acc cct ttg aat 104 
lie Val His Lys Leu Phe Asn Arg Arg lie Asn Gly Thr Pro Leu Asn 
5 10 15. 20 

agt agt aag aga ttt tat ggg gta ata cca atg gta ata gag cac tct 152 
15 Ser Ser Lys Arg Phe Tyr Gly Val lie Pro Met Val He Glu His Ser 

25 30 35 

tea aga gga gaa agg get tat gac ata ttc tea agg eta tta aag gaa 200 
Ser Arg Gly Glu Arg Ala Tyr Asp He Phe Ser Arg Leu Leu Lys Glu 
20 40 45 50 

cga att att tgc att aac ggc ccc att gat gat tec act tct cat gtt 248 

Arg He He Cys He Asn Gly Pro He Asp Asp Ser Thr Ser His Val 

55 60 65 

25 

gtt gtt get cag ctt ctt ttt ctt gaa tct gag aac cct tct aag cct 296 

Val Val Ala Gin Leu Leu Phe Leu Glu Ser Glu Asn Pro Ser Lys Pro 
70 75 80 

30 att cac aag tac etc aac tct cca ggt ggc get gtt aca get ggt ctt 344 
He His Lys Tyr Leu Asn Ser Pro Gly Gly Ala Val Thr Ala Gly Leu 
85 90 95 100 

gca ate tat gat acc acg cag tat ate cga tct cca att cat act ata 392 
35 Ala He Tyr Asp Thr Thr Gin Tyr He Arg Ser Pro He His Thr He 

105 110 115 

tgc eta ggt caa gca get tea atg gga tec ctt etc tta get gca ggt 440 
Cys Leu Gly Gin Ala Ala Ser Met Gly Ser Leu Leu Leu Ala Ala Gly 
40 120 125 130 

gca aag ggt gag aga cga tct etc cct aat get tea gtt atg att cac 488 

Ala Lys Gly Glu Arg Arg Ser Leu Pro Asn Ala Ser Val Met He His 
135 140 145 

45 

cag cct ttc ggt ggg tat age ggg cag get aaa gat ttg acg ate cac 536 

Gin Pro Phe Gly Gly Tyr Ser Gly Gin Ala Lys Asp Leu Thr He His 
150 155 160 

50 aca aaa cag ata gtt egg gta tgg gat act ttg aat gac eta tat gca 584 
Thr Lys Gin He Val Arg Val Trp Asp Thr Leu Asn Asp Leu Tyr Ala 
165 170 175 180 

aag cat aca gga caa cct ata gaa ata att caa aag aat atg gat agg 632 
55 Lys His Thr Gly Gin Pro He Glu He He Gin Lys Asn Met Asp Arg 

185 190 195 

gat tat ttc atg aca cct gaa gag gcg aag gag ttt gga ata ate gat 680 
Asp Tyr Phe Met Thr Pro Glu Glu Ala Lys Glu Phe Gly He He Asp 
60 200 205 210 



gaa gtt ata gat gaa cga cca atg get tta gta act gat get gtt gca 
Glu Val lie Asp Glu Arg Pro Met Ala Leu Val Thr Asp Ala Val Ala 
215 220 225 



728 
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aat gaa gcc aaa gaa aaa ggt tea age tagaaaaatt gctgtaatac 775 
Asn Glu Ala Lys Glu Lys Gly Ser Ser 
230 235 

5 

tgatctcatt gcagtctttg ttagcattta ccatcgctaa ctagttctcc attttactta 835 
ctggtgtatt tactttctag tattttattt gatgaggega tacctcatta ctttgttttc 895 
10 tcagcggccg c 90e 

<210> 18 

15 <211> 237 

<212> PRT 

<213> Nicotiana tabacum 

20 

<400> 18 

25 Met Arg Thr Gin lie Val His Lys Leu Phe Asn Arg Arg He Asn Gly 
1 5 10 15 

Thr Pro Leu Asn Ser Ser Lys Arg Phe Tyr Gly Val He Pro Met Val 
30 20 25 30 

He Glu His Ser Ser Arg Gly Glu Arg Ala Tyr Asp He Phe Ser Arg 
35 4 0 * 45 

35 

Leu Leu Lys Glu Arg He He Cys He Asn Gly Pro He Asp Asp Ser 
50 55 60 

40 

Thr Ser His Val Val Val Ala Gin Leu Leu Phe Leu Glu Ser Glu Asn 
65 70 75 80 

45 Pro Ser Lys Pro He His Lys Tyr Leu Asn Ser Pro Gly Gly Ala Val 

85 90 95 

Thr Ala Gly Leu Ala He Tyr Asp Thr Thr Gin Tyr He Arg Ser Pro 
50 100 105 110 

He His Thr He Cys Leu Gly Gin Ala Ala Ser Met Gly Ser Leu Leu 
115 120 125 

55 

Leu Ala Ala Gly Ala Lys Gly Glu Arg Arg Ser Leu Pro Asn Ala Ser 
130 135 140 

60 

Val Met He His Gin Pro Phe Gly Gly Tyr Ser Gly Gin Ala Lys Asp 
145 150 155 160 
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Leu Thr lie His Thr Lys Gin He Val Arg Val Trp Asp Thr Leu Asn 
165 170 175 

5 Asp Leu Tyx Ala Lys His Thr Gly Gin Pro He Glu He He Gin Lys 
180 185 190 

Asn Met Asp Arg Asp Tyr Phe Met Thr Pro Glu Glu Ala Lys Glu Phe 
10 195 200 205 



15 



20 



45 



50 



Gly He He Asp Glu Val He Asp Glu Arg Pro Met Ala Leu Val Thr 
210 215 220 



Asp Ala Val Ala Asn Glu Ala Lys Glu Lys Gly Ser Ser 
225 230 ~ 235 



<210> 19 
<211> 447 
25 <212> DNA 

<213> Nicotiana tabacutn 

30 

.<400> 19 ~ 

gcggccgctt gcggacaaga taatcgattc aagagacaat gtatttgaga aaaggaacta 60 
tgatgagata ctcgcccaat ctagagctat gaggaaagcc ggaccaggtg ctcaggctgc 120 

■ 35 

tccatctggc ttcaggtgac tggaagagcg gtaatggtcc caaactttca ggaacaactg 180 
ttgttccctt atagtttcga ggaacaaagt tgctggttac ttggtctgtg ccggtataat 240 
40 gtaactggga caaagaacat attgtagaaa ccttgtttga gctgtgaagt ataggggttt 300 
tacaactatt atgcacaggt ctgcaaagag tacccataat gtcaattggt tgtaccagta 360 
tcaaacaatc agatagtgcc agtgtatggt ataaatgaat atagatctct ctgatgtcat 420 
ttttctttta tcatgttcag cggccgc 447 



<210> 20 

<211> 996 

<212> DNA 

55 <213> Nicotiana tabacum 

<400> 20 

60 gcggccgctt gcggacaaga taatcgattc aagagacaat gtatttgaga aaaggaacta 60 

tgatgagata ctcgcccaat ctagagctat gaggaaagcc ggaccaggtg ctcaggctgc 120 

tccatctggc ttcaggtgac tggaagagcg gtaatggtcc caaactttca ggaacaactg 180 
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ttgttccctt atagtttcga ggaacaaagt tgctggttac ttggtctgtg ccggtataat 240 
^ gtaactggga caaagaacat attgtagaaa ccttgtttga gctgtgaagt ataggggttt 300 
tacaactatt atgcacaggt ctgcaaagag tacccataat gtcaattggt tgtaccaggc 360 
ggccgctggc ttcttctttg cttctctctc cgctttctag ctcgacggtt actgaaaatc 420 
gcgagctggg ttctggtaaa tcaactttca tatccagtcc caatttctcc tttgcaactt 480 
ctgttcacag ttgcaggcca aacggcgttc gaggttattg ttacaggtct ccggtagcta 540 
agtctttgga ccatataccc caaaaattca gactggaaaa tctcaaagat ggactactgg 600 
acaactataa aagtgcccct cagtatcttt acggccttag tccttcacag atggatatgt 660 
tcatgacaga agataaccca gtacggcgac agtcagaaag tgccactgag gatagtatat 72 0 
ctgcggccgc tggcagatgc tccacgaarg gataccacag caggtcttgg ttagtccata 780 
cacatcgtat aatttatggc tgatagtggt tgtacgactt gcagtgttat tttgcaattt 840 
cttttgttta atctacatat tgaactcttt tgatctactt attcaaaaac atgaaatcct 900 
gagcagacta gatgcatttg tttaatatca tgaatgcaag gaatccacct acagctgata 960 
tgtatacaaa gatacctttt tttcaagagc ggccgc 996 

30 

<210> 21 
<211> 602 
35 <212> DNA 

<213> Nicotiana tabacum 

40 

<220> 

<221> CDS 
45 <222> (2).. (193) 
<223> 



10 



15 



20 



25 



50 



55 



60 



cgt cat aaa ate gac aag atg tat gtc gcc ttt act gac caa cca att 
Arg His Lys lie Asp Lys Met Tyr Val Ala Phe Thr Asp Gin Pro lie 
20 25 30 



<400> 21 

g egg ccg ctg gaa gat gtg egg cgc caa gtg aac gaa gcg gtt caa cct 49 
Arg Pro Leu Glu Asp Val Arg Arg Gin Val Asn Glu Ala Val Gin Pro 
1 5 10 15 



97 



gag aag gtg caa cag tac act gaa agg gat cgt ttt ttg tct gtc tea 145 
Glu Lys Val Gin Gin Tyr Thr Glu Arg Asp Arg Phe Leu Ser Val Ser 
35 40 45 

gag gcc atg gag ttt ggt etc ata gat ggg gtg eta gaa aca gaa tac 193 
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30 

Glu Ala Met Glu Phe Gly Leu lie Asp Gly Val Leu Glu Thr Glu Tyr 
50 55 60 

tagttgcaaa tgaatcttta gtagtacatg gtagctagcc ttccaatgac gaaaaagctg 253 

5 

gtgttgctca ttaaccactt cgaagtacaa gaagctggct cttgcaaatt tgtatcgtag 313 

aaatatctca actcttcaat ccaggaatgt ccaaaagcct aattctgaag acggttatag 373 

10 aaagcgctct tgttttacta tttttgtctc tcctgcagat acactcagca cttttgtggg 433 

tattaatcag ggtcttaatt catcacttaa tcacaatcca gttggaagcg aagtgatcaa 493 

acacaaagca gattcaggaa gatgtgtatt tttcccaaat atatattact ccaattgcta 553 

tcatcccttc gctgtcgtta tgaaaggata tttattttat agcggccgc 602 



15 



20 



<210> 22 

<211> 64 

<212> PRT 

25 <213> Nicotiana tabacum 



30 



45 



50 



55 



60 



<400> 22 

Arg Pro Leu Glu Asp Val Arg Arg Gin Val Asn Glu Ala Val Gin Pro 
15 10 15 



35 Arg His Lys lie Asp Lys Met Tyr Val Ala Phe Thr Asp Gin Pro lie 
20 25 30 

Glu Lys Val Gin Gin Tyr Thr Glu Arg Asp Arg Phe Leu Ser Val Ser 
40 35 40 45 



Glu Ala Met Glu Phe Gly Leu lie Asp Gly Val Leu Glu Thr Glu Tyr 
50 55 60 

<210> 23 

<211> 16 

<212> DNA 

<213> artificial sequence 
<220> 

<223> primer 

<400> 23 

agaattcgcg gccgct 16 
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<210> 24 
<211> 32 
5 <212> DNA 

<213> artificial sequence 



10 



20 



30 



35 



40 



45 



50 



<220> 

<223> primer 



15 <400> 24 

ctcatgcggc cgcgcgcaac gcaattaatg tg 32 



<210> 25 

<211> 32 

<212> DNA 

25 <213> artificial sequence 



<220> 

<223> primer 

<400> 25 

tcatgcggcc gcgagatcca gttcgatgta ac 32 

<210> 26 

<211> 21 

<212> DNA 

<213> artificial sequence 



<220> 

<223> primer 
<400> 26 

gtggattgat gtgatatctc c 21 



55 <210> 27 

<211> 21 

<212> DNA 

<213> artificial sequence 



60 



WO 2005/054283 PCT/EP2004/013555 

32 

<220> 

<223> primer 
' <400> 27 

gtaaggatct gagctacaca t 21 

<210> 28 

<211> 24 

<212> DNA 

15 <213> artificial sequence 



10 



20 



25 



30 



35 



40 



50 



<220> 

<223> primer 

<400> 28 

tataccatgg atttgccatc tttg 24 

<210> 29 

<211> 21 

<212> DNA 

<213> artificial sequence 



<220> 

<223> primer 
<400> 29 

atagatctca cctggagcca g 21 



45 <210> 30 

<211> 19 

<212> DNA 

<213> artificial sequence 



55 <220> 

<223> primer 
<400> 30 

60 gagcccatgg caagaggag 19 



<210> 31 



WO 2005/054283 PCT/EP2004/0 13555 

33 

<211> 22 
<212> DNA 
5 <213> artificial sequence 



10 



15 



20 



25 



30 



40 



<220> 

<223> primer 

<400> 31 

atagatcttt ctagcttgaa cc 22 

<210> 32 

<211> 21 

<212> DNA 

<213> artificial sequence 



<220> 

<223> primer 
<400> 32 

tcagccatgg cccctggagg a 21 



35 <210> 33 

<211> 24 

<212> DNA 

<213> artificial sequence 



45 <220> 

<223> primer 
<400> 33 

50 taagatcttc agtattctgt ttcc 24 



